tue-mps/eomt
[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).
This project offers a fast and straightforward way to analyze images and videos for segmentation tasks. It takes raw image or video files as input and outputs precise outlines and classifications for objects and regions within them. This tool is ideal for researchers, computer vision engineers, and data scientists working on tasks like medical imaging analysis, autonomous driving, or environmental monitoring.
548 stars. Actively maintained with 1 commit in the last 30 days.
Use this if you need to quickly and accurately identify and separate different objects or regions within images or video footage, especially if you're working with large pre-trained Vision Transformers.
Not ideal if your primary goal is object detection (bounding boxes) without needing detailed pixel-level segmentation, or if you prefer models with complex, task-specific decoders.
Stars
548
Forks
53
Language
Jupyter Notebook
License
MIT
Last pushed
Feb 25, 2026
Commits (30d)
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/tue-mps/eomt"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
OSUPCVLab/SegFormer3D
Official Implementation of SegFormer3D: an Efficient Transformer for 3D Medical Image...
davidiommi/Pytorch--3D-Medical-Images-Segmentation--SALMON
Segmentation deep learning ALgorithm based on MONai toolbox: single and multi-label segmentation...
htcr/sam_road
Segment Anything Model for large-scale, vectorized road network extraction from aerial imagery....
MIC-DKFZ/MedNeXt
[MICCAI 2023] MedNeXt is a fully ConvNeXt architecture for 3D medical image segmentation.
snap-research/EfficientFormer
EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]