A high-performance, AI-driven video summarization tool built with Google Gemini and Firebase Genkit. This tool allows users to transform long videos into concise, actionable summaries in seconds.
In an era of information overload, watching lengthy videos to find specific information is time-consuming. Summarize-A-Video AI leverages state-of-the-art Generative AI to provide users with immediate value by condensing hours of content into minutes of reading.
- 📝 AI Transcription: Converts speech from video or audio files into accurate text.
- 🤖 AI-Powered Summarization: Deeply analyzes content to extract key points, main themes, and recurring topics.
- 📍 Timestamped Highlights: Automatically generates a timeline of important moments.
- 🌍 Multi-Language Support: Supports summaries in English and Hindi.
- 🔗 URL & File Support: Process local uploads (.mp4, .mov, .avi) or remote YouTube links.
- ⚡ Real-time Processing: Fast analysis using the latest Gemini models.
- Frontend: Vanilla HTML5, CSS3 (Premium Dark Mode), JavaScript
- Backend: Node.js, TypeScript, Express.js
- AI Core: Google Gemini 1.5 Pro / Flash
- Framework: Firebase Genkit
- Deployment: Optimized for Docker & Google Cloud Run
The application follows a modular architecture designed for scalability:
- Core Engine: Handles the integration with Gemini and Firebase Genkit.
- Data Layer: Manages transient video processing states and local caching.
- UI Component: A clean, responsive interface focused on accessibility and speed.
- Node.js: Version 18.x or higher
- API Key: A valid Google AI Studio (Gemini) API Key
-
Clone the repository:
git clone https://github.com/your-username/video-summary-using-ai-tools.git cd video-summary-using-ai-tools -
Install dependencies:
npm install
-
Configure Environment: Create a
.envfile in the root directory:GOOGLE_GENAI_API_KEY=your_api_key_here
-
Run Development Server:
npm run dev
We track daily improvements in our JOURNAL.md. This project follows a philosophy of atomic, meaningful commits to ensure long-term maintainability.
This project is licensed under the ISC License - see the LICENSE file for details.
For questions, support, or collaborations, contact Rishu Ray at rayrishu19@gmail.com.
