tue-mps/eomt

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).

/ 100

Established

This project offers a fast and straightforward way to analyze images and videos for segmentation tasks. It takes raw image or video files as input and outputs precise outlines and classifications for objects and regions within them. This tool is ideal for researchers, computer vision engineers, and data scientists working on tasks like medical imaging analysis, autonomous driving, or environmental monitoring.

548 stars. Actively maintained with 1 commit in the last 30 days.

Use this if you need to quickly and accurately identify and separate different objects or regions within images or video footage, especially if you're working with large pre-trained Vision Transformers.

Not ideal if your primary goal is object detection (bounding boxes) without needing detailed pixel-level segmentation, or if you prefer models with complex, task-specific decoders.

image-segmentation video-analysis computer-vision pattern-recognition medical-imaging

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

548

Forks

Language

Jupyter Notebook

License

MIT

Related tools

OSUPCVLab/SegFormer3D

Official Implementation of SegFormer3D: an Efficient Transformer for 3D Medical Image...

davidiommi/Pytorch--3D-Medical-Images-Segmentation--SALMON

Segmentation deep learning ALgorithm based on MONai toolbox: single and multi-label segmentation...

htcr/sam_road

Segment Anything Model for large-scale, vectorized road network extraction from aerial imagery....

MIC-DKFZ/MedNeXt

[MICCAI 2023] MedNeXt is a fully ConvNeXt architecture for 3D medical image segmentation.

snap-research/EfficientFormer

EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights