Jiannan Huang

Rbrq
Ā·

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago
black-forest-labs/FLUX.1-dev
liked a Space 2 months ago
amirgame197/Image-to-Drawing
liked a model 4 months ago
Qwen/Qwen2-VL-2B-Instruct
View all activity

Organizations

None yet

Rbrq's activity

liked a Space 2 months ago
upvoted an article 5 months ago
reacted to merve's post with šŸ”„ 6 months ago
view post
Post
5085
Real-time DEtection Transformer (RT-DETR) landed in transformers šŸ¤© with Apache 2.0 license šŸ˜

šŸ”– models: https://huggingface.co/PekingU
šŸ”– demo: merve/RT-DETR-tracking-coco
šŸ“ paper: DETRs Beat YOLOs on Real-time Object Detection (2304.08069)
šŸ“– notebook: https://github.com/merveenoyan/example_notebooks/blob/main/RT_DETR_Notebook.ipynb

YOLO models are known to be super fast for real-time computer vision, but they have a downside with being volatile to NMS šŸ„²

Transformer-based models on the other hand are computationally not as efficient šŸ„²

Isn't there something in between? Enter RT-DETR!

The authors combined CNN backbone, multi-stage hybrid decoder (combining convs and attn) with a transformer decoder. In the paper, authors also claim one can adjust speed by changing decoder layers without retraining altogether.
The authors find out that the model performs better in terms of speed and accuracy compared to the previous state-of-the-art. šŸ¤©
liked a Space about 1 year ago
updated a model about 1 year ago