Course Ally

A powerful command-line tool for extracting and processing YouTube content. Generate lectures, transcripts, and organize video links from YouTube videos and playlists using OpenAI's APIs.

Requirements

Python 3.8 or higher
OpenAI API key: Set in environment variable export OPENAI_API_KEY=your_api_key

FFmpeg: Required for audio processing and chunking large files

# Ubuntu/Debian
sudo apt install ffmpeg

# macOS
brew install ffmpeg

# Windows
# Download from https://ffmpeg.org/download.html

Install dependencies:
```
pip install -r requirements.txt
```

Features

🎓 Create Lecture

Generate structured lecture content from a single YouTube video.

python main.py create-lecture <VIDEO_ID> [OPTIONS]

Options:

--output, -o: Save location for markdown file
--sections, -s: Number of main sections (default: 3)

What it does:

Downloads and converts video to audio
Transcribes audio using OpenAI Whisper
Generates structured lecture content with sections and key points
Outputs clean markdown format

🎬 Extract Transcripts

Smart transcription that automatically detects video vs playlist URLs.

python main.py extract-playlist-transcripts <YOUTUBE_URL> [OPTIONS]

Options:

--subfolder, -s: Optional subfolder in outputs/transcripts
--format, -f: Output format (txt/json, default: txt)
--max-workers, -w: Parallel workers for playlists (default: 4)

What it does:

Smart URL Detection: Automatically recognizes video vs playlist URLs
Single Videos: Downloads, transcribes, and saves with metadata
Playlists: Processes all videos in parallel (4x faster!)
Formatted Output: One sentence per line for better readability
Large File Support: Automatically chunks audio files >25MB
Unified Interface: No need to choose between video/playlist commands

Supported URL formats:

https://www.youtube.com/watch?v=VIDEO_ID (single video)
https://youtu.be/VIDEO_ID (single video)
https://www.youtube.com/playlist?list=PLAYLIST_ID (playlist)
https://www.youtube.com/watch?v=VIDEO_ID&list=PLAYLIST_ID (playlist)

🎤 Transcribe Local Audio Files

Transcribe audio files that are already stored locally on your system.

python main.py transcribe-local-audio [OPTIONS]

Options:

--subfolder, -s: Optional subfolder in outputs/transcripts
--format, -f: Output format (txt/json, default: txt)
--max-workers, -w: Parallel workers (default: 4)

What it does:

Interactive Subfolder Selection: Browse and select from available audio subfolders
Flexible File Selection: Process all files or select specific ones
Parallel Processing: Transcribe multiple audio files simultaneously
Large File Support: Automatically chunks audio files >25MB
Format Support: mp3, wav, m4a, ogg, flac
Progress Tracking: Real-time updates for each file being processed

How to use:

Place your audio files in outputs/audios/[subfolder_name]/
Run the command
Select the subfolder containing your audio files
Choose to process all files or select specific ones
Transcripts are saved to outputs/transcripts/[subfolder_name]/

Example directory structure:

outputs/audios/
├── pba/                    # Your audio subfolder
│   ├── lecture1.m4a
│   ├── lecture2.m4a
│   └── lecture3.m4a

🔗 Extract Playlist Links

Extract and organize all video links from a YouTube playlist into markdown.

python main.py extract-playlist-links <PLAYLIST_URL> [OPTIONS]

Options:

--subfolder, -s: Optional subfolder in outputs/yt_links
--live-format, -l: Use live embed format instead of regular links

What it does:

Extracts playlist metadata and video information
Generates organized markdown with playlist summary
Creates properly formatted video links:
- Regular format: ![lecture](video_url)
- Live format: <liveUrl>https://youtube.com/embed/video_id</liveUrl>
Includes video titles, durations, and playlist description

Output Structure

All outputs are organized in the outputs/ directory:

outputs/
├── audios/              # Local audio files for transcription
│   ├── subfolder/       # Organize by project/topic
│   └── *.mp3/*.m4a      # Audio files (mp3, wav, m4a, ogg, flac)
├── transcripts/         # Video/audio transcriptions
│   ├── subfolder/       # Optional subfolders
│   └── *.txt/*.json     # Transcript files
└── yt_links/            # Playlist link exports
    ├── subfolder/       # Optional subfolders
    └── *.md             # Markdown link files

Interactive Mode

Run without arguments for guided setup:

python main.py

The interactive menu will guide you through:

Feature selection
Input collection (URLs, options)
Configuration (workers, formats, etc.)

Advanced Features

🚀 Parallel Processing

Process multiple videos simultaneously (up to 4 workers by default)
Configurable worker count for optimal performance
Thread-safe progress reporting

📦 Large File Handling

Automatic chunking for audio files >25MB
Smart duration-based splitting using FFmpeg
Seamless chunk transcription and recombination

🎯 Smart Resumption

Skips already processed videos automatically
Continues interrupted playlist processing
Maintains consistent file naming

Contributing

Contributions are welcome! Please open an issue or submit a pull request.

You can tip me via LN through this link.

License

This project is licensed under the MIT License. See license.md for details.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
course_components		course_components
docs		docs
templates		templates
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
app.py		app.py
license.md		license.md
main.py		main.py
requirements-web.txt		requirements-web.txt
requirements.txt		requirements.txt
run_web.py		run_web.py
test_chapters.py		test_chapters.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Course Ally

Requirements

Features

🎓 Create Lecture

🎬 Extract Transcripts

🎤 Transcribe Local Audio Files

🔗 Extract Playlist Links

Output Structure

Interactive Mode

Advanced Features

🚀 Parallel Processing

📦 Large File Handling

🎯 Smart Resumption

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Course Ally

Requirements

Features

🎓 Create Lecture

🎬 Extract Transcripts

🎤 Transcribe Local Audio Files

🔗 Extract Playlist Links

Output Structure

Interactive Mode

Advanced Features

🚀 Parallel Processing

📦 Large File Handling

🎯 Smart Resumption

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages