A Flask web application that generates Markdown documents by processing videos or GitHub repositories as input. The application transcribes speech using OpenAI Whisper, extracts screenshots at moments when specific keywords are spoken, and embeds both the text and screenshots into the Markdown document for seamless documentation.
Demo Video: https://youtu.be/xHDUDiyhM9k
NosuAI provides an efficient solution for generating Markdown documentation. Whether you input a GitHub repository or a video file, NosuAI processes the content to create a Markdown document enriched with text and embedded screenshots. It's ideal for developers, researchers, and content creators looking to automate documentation tasks.
- Markdown Generation: Automatically generates Markdown documents with transcriptions and embedded screenshots.
- Input Options:
- Process GitHub repositories to extract README data and embedded elements.
- Process video files to extract audio, transcribe content, and capture screenshots based on keywords.
- Speech Transcription: Utilize OpenAI Whisper for accurate transcription.
- Screenshot Integration: Generate and embed screenshots at moments when keywords are spoken.
- Secure Development: HTTPS support for local OAuth callbacks.
- Debugging Tools: Comprehensive logging for efficient troubleshooting.
nosuai/
├── app.py # Main Flask application
├── requirements.txt # Python dependencies
├── runtime.txt # Specifies Python version for deployment
├── Procfile # Process declaration for Heroku
├── Aptfile # Additional dependencies for deployment
├── .env.example # Environment variable template
├── apps/
│ ├── routes/
│ │ ├── github_processing.py
│ │ ├── video_processing.py
│ │ ├── create_markdown.py
│ │ ├── github.py
│ │ ├── notion.py
│ │ └── transcription_with_timestamps.py
│ ├── static/
│ │ └── css/
│ ├── templates/
│ │ ├── upload.html
│ │ ├── github.html
│ │ └── notion.html
│ └── utils/
│ └── clean_samples.py
- Python 3.11+
- FFmpeg
- OpenCV
- OpenAI API Key
- (Optional) GitHub & Notion OAuth credentials
-
Clone the repository:
git clone https://github.com/shivamrawat1/nosuai.git cd nosuai -
Create and activate a virtual environment:
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
-
Install dependencies:
pip install -r requirements.txt
-
Create a
.envfile in the project root:OPENAI_API_KEY=your-openai-api-key FLASK_SECRET_KEY=your-secret-key GITHUB_TOKEN=your-github-hub-token (OPTIONAL to remove rate limits)
-
Start the Flask application:
python app.py
-
Open your browser at https://127.0.0.1:5000.
- Markdown generation from video inputs.
- Transcription and screenshot embedding.
- Full GitHub repository processing with enhanced README generation.
- Support for multilingual transcription.
- Enhanced UI for easier input selection and customization.
Special thanks to the OpenAI, Flask, and developer communities for their tools and resources.