Detecto is an AI-powered object detection system that identifies and locates multiple objects in real-time video streams. It uses state-of-the-art deep learning models to deliver fast, accurate, and reliable object detection while ignoring person detections for privacy purposes.
- 🚀 Real-time object detection using DETR (DEtection TRansformer) model
- 🔒 Privacy-focused: Automatically ignores person detections
- 🎥 Live camera feed processing with bounding boxes and labels
- ⚡ Optimized for performance with multi-threading
- 🖼️ Image and webcam snapshot testing capabilities
- 🧠 91 object classes detection (excluding persons)
- 🖥️ Cross-platform compatibility
-
Clone the repository:
git clone https://github.com/CrazyBong/Detecto-Object-detection-model.git cd Detecto-Object-detection-model -
Install the required dependencies:
pip install -r requirements.txt
-
Run the application:
python main.py
Run the main application to start real-time object detection:
python main.pyControls:
- Press 'q' to quit the application
The project includes a comprehensive testing suite:
python test.pyAvailable test modes:
- Test on existing image files
- Capture from webcam and test detection
- Quick model verification
The system uses the facebook/detr-resnet-50 model from Hugging Face Transformers, which is based on the DETR (DEtection TRansformer) architecture. This model can detect 91 different object classes with high accuracy.
- Multi-threading: Separates camera capture, object detection, and display into different threads
- Frame skipping: Processes every Nth frame to maintain real-time performance
- GPU acceleration: Automatically utilizes CUDA if available
- Confidence thresholding: Filters low-confidence detections
- Person detection filtering: All person detections are automatically filtered out
- No data storage: No images or videos are saved by default
- Local processing: All processing happens on your device
See requirements.txt for detailed dependencies.
.
├── main.py # Real-time camera object detection
├── test.py # Image and webcam testing suite
├── requirements.txt # Python dependencies
├── README.md # This file
└── .gitignore # Git ignore rules
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
- DETR model by Facebook AI Research
- Hugging Face Transformers library
- OpenCV for computer vision operations