affomernet

Build deep learning network for affordances concept using transformer

Purpose

Affomernet trains a transformer-based detector that produces both object bounding boxes and per-object affordance segmentation masks. Example: it detects a hammer with a bounding box and segments the handle region as a grasping affordance. The framework supports COCO and IIT datasets, reports mAP for boxes and affordance masks, and can export to ONNX for deployment.

Prerequisites

Before setting up the project, ensure you have the following installed on your system:

Ubuntu (latest version recommended)
Git
Anaconda or Miniconda (Download from Anaconda's official website)

Install Anaconda

Download the Anaconda installer:

wget https://repo.anaconda.com/archive/Anaconda3-2023.09-0-Linux-x86_64.sh

Install Anaconda:

bash Anaconda3-2023.09-0-Linux-x86_64.sh

Follow the prompts and accept the license terms. After installation, initialize Anaconda by either restarting your terminal or running:

source ~/.bashrc

Setup

Follow these steps to set up the project environment using Anaconda:

1. Create a New Conda Environment

Create a new environment with Python 3.11:

conda create --prefix affomernet_env python=3.11

2. Activate the Conda Environment

conda activate affomernet_env

3. Verify the Installation

Check that you're using the correct Python version:

python --version

The output should show Python 3.11.x.

Install

With the conda environment activated, install the project dependencies:

pip install --upgrade pip
pip install -r requirements.txt

Dataset Preparation

Download and extract COCO 2017 train and val images:

cd dataset/coco
wget http://images.cocodataset.org/zips/train2017.zip
wget http://images.cocodataset.org/zips/val2017.zip
unzip train2017.zip
unzip val2017.zip
rm train2017.zip val2017.zip
wget http://images.cocodataset.org/annotations/annotations_trainval2017.zip
unzip annotations_trainval2017.zip
rm annotations_trainval2017.zip

Modify img_folder and ann_file in coco_detection.yml.

Training

Single GPU

export CUDA_VISIBLE_DEVICES=0
python tools/train.py -c configs/rtdetr/rtdetr_r50vd_6x_iit.yml

Fine-tune from a previously trained model:

export CUDA_VISIBLE_DEVICES=0
python tools/train.py -c configs/rtdetr/rtdetr_r50vd_6x_iit.yml -t output/rtdetr_r50vd_6x_coco/checkpoint0069.pth &> train.log 2>&1

Multi-GPU

export CUDA_VISIBLE_DEVICES=0,1,2,3
torchrun --nproc_per_node=4 tools/train.py -c configs/rtdetr/rtdetr_r50vd_6x_iit.yml

Validation

export CUDA_VISIBLE_DEVICES=0,1,2,3
torchrun --nproc_per_node=4 tools/train.py -c configs/rtdetr/rtdetr_r50vd_6x_iit.yml -r path/to/checkpoint --test-only

Export to ONNX

python tools/export_onnx.py -c configs/rtdetr/rtdetr_r50vd_6x_iit.yml -r output/rtdetr_r50vd_6x_iit/checkpoint0071.pth --check

Example:

python tools/export_onnx.py -c configs/rtdetr/rtdetr_r101vd_6x_coco.yml -r output/rtdetr_r101vd_2x_coco_objects365_from_paddle.pth --check -f output/rtdetr_r101vd_coco_objects365.onnx
python tools/export_onnx.py -c configs/rtdetr/rtdetr_r50vd_6x_iit.yml -r output/rtdetr_r50vd_6x_iit/checkpoint0071.pth --check -f output/rtdetr_r50vd_6x_iit_1.onnx

Inference

python tools/export_onnx.py --inference --file-name output/model.onnx --image path/to/your/image.jpg

Example:

python tools/export_onnx.py --inference --file-name output/rtdetr_r101vd_coco_objects365.onnx --image dataset/coco/val2017/000000000139.jpg
python tools/export_onnx.py --inference --dataset iit --file-name output/iit_model01.onnx --image dataset/iit/data/VOCdevkit2012/VOC2012/JPEGImages/0.jpg

Custom Data Training

Set remap_mscoco_category: False in the config file
Modify mscoco_category2name based on your dataset if needed
Add -t path/to/checkpoint (optional) to fine-tune based on a pretrained checkpoint

Additional Training/Testing Scripts

# Train on multiple GPUs
CUDA_VISIBLE_DEVICES=0,1,2,3 torchrun --nproc_per_node=4 --master-port=8989 tools/train.py -c path/to/config &> train.log 2>&1 &

# Additional options
-r path/to/checkpoint  # Resume from checkpoint
--amp                  # Use Automatic Mixed Precision
--test-only            # Run evaluation only

# Fine-tuning example
torchrun --master_port=8844 --nproc_per_node=4 tools/train.py -c configs/rtdetr/rtdetr_r18vd_6x_coco.yml -t https://github.com/lyuwenyu/storage/releases/download/v0.1/rtdetr_r18vd_5x_coco_objects365_from_paddle.pth

Deactivating the Virtual Environment

Once you are done working in the virtual environment, you can deactivate it to return to your system's default Python environment.

Using the deactivate Command

If you activated the virtual environment manually, simply run:

deactivate

Example:

(affomernet_env) user@machine:~/affomernet$ deactivate
user@machine:~/affomernet$

Automatically Deactivate on Shell Exit

The virtual environment activated via pyenv is tied to your shell session. It will automatically deactivate when you:

Close the terminal
Navigate away from the project directory (if using pyenv local)

Removing the Local Python Version

If you want to stop using the virtual environment without deactivating it manually, you can remove the .python-version file from your project directory:

cd /path/to/your/affomernet
rm .python-version

This will revert to the global Python version set by pyenv.

Troubleshooting

If you encounter issues, here are some common troubleshooting steps:

If GPUs Do Not Release Memory

Terminate any lingering training processes:

ps aux | grep "tools/train.py" | awk '{print $2}' | xargs kill -9

To Save All Logs

Append &> train.log 2>&1 & or &> train.log 2>&1 to your command.

Example:

python tools/train.py -c configs/rtdetr/rtdetr_r50vd_6x_iit.yml &> train.log 2>&1 &

Virtual Environment Issues

deactivate Command Not Found: Ensure that the virtual environment is activated properly using pyenv activate affomernet_env
Persistent Activation: Check your shell configuration files (e.g., ~/.bashrc, ~/.zshrc) for any lines that automatically activate a virtual environment and remove or comment them out if necessary

Name		Name	Last commit message	Last commit date
Latest commit History 167 Commits
configs		configs
docs		docs
src		src
tests		tests
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

affomernet

Purpose

Table of Contents

Prerequisites

Install Anaconda

Setup

1. Create a New Conda Environment

2. Activate the Conda Environment

3. Verify the Installation

Install

Dataset Preparation

Training

Single GPU

Multi-GPU

Validation

Export to ONNX

Inference

Custom Data Training

Additional Training/Testing Scripts

Deactivating the Virtual Environment

Using the deactivate Command

Automatically Deactivate on Shell Exit

Removing the Local Python Version

Troubleshooting

If GPUs Do Not Release Memory

To Save All Logs

Virtual Environment Issues

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

affomernet

Purpose

Table of Contents

Prerequisites

Install Anaconda

Setup

1. Create a New Conda Environment

2. Activate the Conda Environment

3. Verify the Installation

Install

Dataset Preparation

Training

Single GPU

Multi-GPU

Validation

Export to ONNX

Inference

Custom Data Training

Additional Training/Testing Scripts

Deactivating the Virtual Environment

Using the deactivate Command

Automatically Deactivate on Shell Exit

Removing the Local Python Version

Troubleshooting

If GPUs Do Not Release Memory

To Save All Logs

Virtual Environment Issues

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages