Video Captioning Using Python

Bridging Silence: A Real-Time Sign Language to English Text Translation System Using Python, OpenCV, and Convolutional Neural Networks

Bridging communication gaps between hearing and hearing-impaired individuals is an important challenge in assistive ...

Zero-Shot Audio Captioning Using Soft and Hard Prompts

Abstract: In traditional audio captioning methods, a model is usually trained in a fully supervised manner using a human-annotated dataset containing audio-text pairs and then evaluated on the test ...

IEEE

Enhanced Image Captioning Using CNN and BLIP Models

Abstract: This research focuses on generating image captions using Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) models. As deep learning advances, the availability of large ...

GitHub

mrgoonie/vidcap-mcp-server

This project provides a Model Context Protocol (MCP) server that acts as a proxy to the VidCap YouTube API, allowing AI assistants to easily access YouTube video data and functionalities. It also ...

Microsoft

To Create What You Tell: Generating Videos from Captions

We are creating multimedia contents everyday and everywhere. While automatic content generation has played a fundamental challenge to multimedia community for decades, recent advances of deep learning ...

Variety

Google Removes AI Videos of Disney Characters After Cease and Desist Letter

Google has removed dozens of AI-generated videos that depicted Disney-owned characters after receiving a cease and desist letter from the studio on Wednesday. Disney flagged the YouTube links to the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results