All Computer Vision Tools
2,349 tools ranked by quality score · Page 5 of 24
| # | Tool | Score | Tier |
|---|---|---|---|
| 401 |
geekquad/Pixel-Processing
📷 This repository is focused on having various feature implementation of... |
|
Established |
| 402 |
lightly-ai/labelformat
A tool for converting computer vision label formats. |
|
Established |
| 403 |
OSU-NLP-Group/UGround
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents |
|
Emerging |
| 404 |
jmanhype/vggt-mps
VGGT 3D Vision Agent optimized for Apple Silicon with Metal Performance Shaders |
|
Emerging |
| 405 |
andyzeng/tsdf-fusion-python
Python code to fuse multiple RGB-D images into a TSDF voxel volume. |
|
Emerging |
| 406 |
Unity-Technologies/Robotics-Object-Pose-Estimation
A complete end-to-end demonstration in which we collect training data in... |
|
Emerging |
| 407 |
SkalskiP/make-sense
Free to use online tool for labelling photos. https://makesense.ai |
|
Emerging |
| 408 |
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。 |
|
Emerging |
| 409 |
Fabric-Project/Fabric
Node Creative Coding / 3D / Image Processing tool inspired by Quartz Composer |
|
Emerging |
| 410 |
cuixing158/Awesome-CV-MasterHub
:fire: :fire: :fire: A paper list of some recent Computer Vision(CV) works |
|
Emerging |
| 411 |
zc-alexfan/arctic
[CVPR 2023] Official repository for downloading, processing, visualizing,... |
|
Emerging |
| 412 |
1adrianb/binary-human-pose-estimation
This code implements a demo of the Binarized Convolutional Landmark... |
|
Emerging |
| 413 |
ahrs365/navsim-local
一个轨迹优化仿真器,可以加载json地图,进行动态障碍物仿真,支持不同的规划器插件,当前默认配置了一个alm-lbfgs优化的规划器插件。配合在线场景编辑器进行使用。 |
|
Emerging |
| 414 |
last-one/Pytorch_Realtime_Multi-Person_Pose_Estimation
Pytorch version of Realtime Multi-Person Pose Estimation project |
|
Emerging |
| 415 |
neeru1207/AI_Sudoku
GUI based Smart Sudoku Solver that tries to extract a sudoku puzzle from a... |
|
Emerging |
| 416 |
andyzeng/tsdf-fusion
Fuse multiple depth frames into a TSDF voxel volume. |
|
Emerging |
| 417 |
memoakten/webcam-pix2pix-tensorflow
Source code and pretrained model for running pix2pix in realtime on a webcam feed. |
|
Emerging |
| 418 |
foo123/FILTER.js
Video and Image Processing and Computer Vision Library for JavaScript... |
|
Emerging |
| 419 |
lucasjinreal/yolov7_d2
🔥🔥🔥🔥 (Earlier YOLOv7 not official one) YOLO with Transformers and Instance... |
|
Emerging |
| 420 |
iwatake2222/self-driving-ish_computer_vision_system
This project generates images you've probably seen in autonomous driving... |
|
Emerging |
| 421 |
col14m/cadrille
[ICLR2026] cadrille: Multi-modal CAD Reconstruction with Online... |
|
Emerging |
| 422 |
xuebinqin/U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net:... |
|
Emerging |
| 423 |
FateScript/CenterNet-better
An easy to understand and better performance version of CenterNet |
|
Emerging |
| 424 |
zaina-ml/ml_forge
A visual-based graph node editor for training computer vision models. |
|
Emerging |
| 425 |
LdDl/rust-road-traffic
Vehicle counting/tracking and speed estimation |
|
Emerging |
| 426 |
google/stereo-magnification
Code accompanying the SIGGRAPH 2018 paper "Stereo Magnification: Learning... |
|
Emerging |
| 427 |
ChirikjianLab/Marching-Primitives
[CVPR2023 Highlight] Marching-Primitives: Shape Abstraction from Signed... |
|
Emerging |
| 428 |
uber-research/UPSNet
UPSNet: A Unified Panoptic Segmentation Network |
|
Emerging |
| 429 |
adityamwagh/SuperSLAM
SuperSLAM: Open Source Framework for Deep Learning based Visual SLAM (Work... |
|
Emerging |
| 430 |
TIBHannover/GeoEstimation
This repository contains all necessary meta information, results and source... |
|
Emerging |
| 431 |
noshluk2/ROS2-Self-Driving-Car-AI-using-OpenCV
ROS2 Self Driving Car using Deeplearning and Object Tracking through openCV |
|
Emerging |
| 432 |
mit-han-lab/spvnas
[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution |
|
Emerging |
| 433 |
mesutpiskin/opencv-object-detection
:camera: Object detection with OpenCV on Java. DNN, HaarCascade, Template... |
|
Emerging |
| 434 |
augmentedstartups/AS-One
Easy & Modular Computer Vision Detectors, Trackers & SAM - Run... |
|
Emerging |
| 435 |
facemoji/mocap4face
Cross-platform SDK for facial motion capture producing blendshapes and rigid... |
|
Emerging |
| 436 |
fundamentalvision/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only... |
|
Emerging |
| 437 |
stefanopini/simple-HRNet
Multi-person Human Pose Estimation with HRNet in Pytorch |
|
Emerging |
| 438 |
Shank2358/GGHL
This is the implementation of GGHL (A General Gaussian Heatmap Label... |
|
Emerging |
| 439 |
mats-robotics/yolov5_ros
A complete ROS interface for running YOLOv5 inference |
|
Emerging |
| 440 |
Uason-Chen/CTR-GCN
[ICCV2021] Official code for "Channel-wise Topology Refinement Graph... |
|
Emerging |
| 441 |
eBay/modanet
ModaNet: A large-scale street fashion dataset with polygon annotations |
|
Emerging |
| 442 |
jiwoon-ahn/irn
Weakly Supervised Learning of Instance Segmentation with Inter-pixel... |
|
Emerging |
| 443 |
ultralytics/xview-yolov3
xView 2018 Object Detection Challenge: YOLOv3 Training and Inference. |
|
Emerging |
| 444 |
Charmve/Surface-Defect-Detection
📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical... |
|
Emerging |
| 445 |
CBICA/CaPTk
Cancer Imaging Phenomics Toolkit (CaPTk) is a software platform to perform... |
|
Emerging |
| 446 |
NMGRL/pychron
Data acquisition and processing framework for Ar-Ar geochronology and noble... |
|
Emerging |
| 447 |
CalciferZh/minimal-hand
A minimal solution to hand motion capture from a single color camera at over... |
|
Emerging |
| 448 |
lvchuandong/Awesome-Multi-Camera-3D-Occupancy-Prediction
Awesome papers and code about Multi-Camera 3D Occupancy Prediction, such as... |
|
Emerging |
| 449 |
realsenseai/hand_tracking_samples
:wave: :ok_hand: research codebase for depth-based hand pose estimation... |
|
Emerging |
| 450 |
mahmoodlab/HIPT
Hierarchical Image Pyramid Transformer - CVPR 2022 (Oral) |
|
Emerging |
| 451 |
dog-qiuqiu/MobileNet-Yolo
MobileNetV2-YoloV3-Nano: 0.5BFlops 3MB HUAWEI P40: 6ms/img,... |
|
Emerging |
| 452 |
StanfordVL/GibsonEnv
Gibson Environments: Real-World Perception for Embodied Agents |
|
Emerging |
| 453 |
hijam-git/Porda-AI
The World's First Realtime onDevice AI Project to maintian Modestey and... |
|
Emerging |
| 454 |
line/lighthouse
[EMNLP2024 Demo], [ICASSP 2025], [ICASSP 2026] A user-friendly library for... |
|
Emerging |
| 455 |
lim-anggun/FgSegNet
FgSegNet: Foreground Segmentation Network, Foreground Segmentation Using... |
|
Emerging |
| 456 |
alibaba/EasyCV
An all-in-one toolkit for computer vision |
|
Emerging |
| 457 |
ambakick/Person-Detection-and-Tracking
A tensorflow implementation with SSD model for person detection and Kalman... |
|
Emerging |
| 458 |
trekhleb/links-detector
📖 👆🏻 Links Detector makes printed links clickable via your smartphone... |
|
Emerging |
| 459 |
HusseinYoussef/Arabic-OCR
OCR system for Arabic language that converts images of typed text to... |
|
Emerging |
| 460 |
isarsoft/yolov4-triton-tensorrt
This repository deploys YOLOv4 as an optimized TensorRT engine to Triton... |
|
Emerging |
| 461 |
e-candeloro/Driver-State-Detection
A real time, webcam based, driver attention state detection/monitoring... |
|
Emerging |
| 462 |
riccardocadei/photovoltaic-detection
Detecting available rooftop area from satellite images to install... |
|
Emerging |
| 463 |
nsavinov/semantic3dnet
Point cloud semantic segmentation via Deep 3D Convolutional Neural Network |
|
Emerging |
| 464 |
Srameo/LED
[ICCV 2023] Lighting Every Darkness in Two Pairs: A Calibration-Free... |
|
Emerging |
| 465 |
PeterL1n/BackgroundMattingV2
Real-Time High-Resolution Background Matting |
|
Emerging |
| 466 |
pixano/pixano-inference
Inference models for Pixano |
|
Emerging |
| 467 |
sniklaus/3d-ken-burns
an implementation of 3D Ken Burns Effect from a Single Image using PyTorch |
|
Emerging |
| 468 |
louisejuliedelhaye/SANDI
Sediment ANalysis and Delineation through Images. A free and open-source... |
|
Emerging |
| 469 |
OAID/TengineKit
TengineKit - Free, Fast, Easy, Real-Time Face Detection & Face Landmarks &... |
|
Emerging |
| 470 |
PRBonn/LiDAR-MOS
(LMNet) Moving Object Segmentation in 3D LiDAR Data: A Learning-based... |
|
Emerging |
| 471 |
FangjinhuaWang/PatchmatchNet
Official code of PatchmatchNet (CVPR 2021 Oral) |
|
Emerging |
| 472 |
ibaiGorordo/ONNX-YOLOv7-Object-Detection
Python scripts performing object detection using the YOLOv7 model in ONNX. |
|
Emerging |
| 473 |
cvg/nice-slam
[CVPR'22] NICE-SLAM: Neural Implicit Scalable Encoding for SLAM |
|
Emerging |
| 474 |
wellflat/imageprocessing-labs
computer vision, image processing and machine learning on the web browser or node. |
|
Emerging |
| 475 |
ChibaniMohamed/Polaris
Face recognition attendance system . |
|
Emerging |
| 476 |
DoubangoTelecom/ultimateMRZ-SDK
Machine-readable zone/travel document (MRZ / MRTD) detector and recognizer... |
|
Emerging |
| 477 |
Imageomics/Prompt_CAM
This is an official implementation for PROMPT-CAM: A Simpler Interpretable... |
|
Emerging |
| 478 |
zc-alexfan/hold
[CVPR 2024✨Highlight] Official repository for HOLD, the first method that... |
|
Emerging |
| 479 |
wkentaro/morefusion
MoreFusion: Multi-object Reasoning for 6D Pose Estimation from Volumetric... |
|
Emerging |
| 480 |
Yochengliu/Relation-Shape-CNN
Relation-Shape Convolutional Neural Network for Point Cloud Analysis (CVPR... |
|
Emerging |
| 481 |
ceciliavision/perceptual-reflection-removal
Single Image Reflection Separation with Perceptual Losses |
|
Emerging |
| 482 |
raviksharma/blurfaces
blurs faces in video |
|
Emerging |
| 483 |
mks0601/V2V-PoseNet_RELEASE
Official Torch7 implementation of "V2V-PoseNet: Voxel-to-Voxel Prediction... |
|
Emerging |
| 484 |
jabberjabberjabber/ImageIndexer
Creates an index of images, queries a local LLM and adds tags to the image metadata |
|
Emerging |
| 485 |
bit-bots/imagetagger
An open source online platform for collaborative image labeling |
|
Emerging |
| 486 |
pathak22/unsupervised-video
[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web |
|
Emerging |
| 487 |
zamhown/wear-a-mask
😷 An SPA that uses only the front-end to perform deep-learning-based facial... |
|
Emerging |
| 488 |
Joao-M-Silva/padel_analytics
AI-powered padel analytics |
|
Emerging |
| 489 |
lyxok1/Tiny-DSOD
Tiny-DSOD: Lightweight Object Detection for Resource-Restricted Usage |
|
Emerging |
| 490 |
michalfaber/tensorflow_Realtime_Multi-Person_Pose_Estimation
Multi-Person Pose Estimation project for Tensorflow 2.0 with a small and... |
|
Emerging |
| 491 |
GewelsJI/SINet-V2
Concealed Object Detection (SINet-V2, IEEE TPAMI 2022). Code is implemented... |
|
Emerging |
| 492 |
MaryamBoneh/Vehicle-Detection
Vehicle Detection Using Deep Learning and YOLO Algorithm |
|
Emerging |
| 493 |
filaPro/cad-recode
[ICCV2025] CAD-Recode: Reverse Engineering CAD Code from Point Clouds |
|
Emerging |
| 494 |
visualbuffer/copilot
Lane and obstacle detection for active assistance during driving. Uses... |
|
Emerging |
| 495 |
mkocabas/EpipolarPose
Self-Supervised Learning of 3D Human Pose using Multi-view Geometry (CVPR2019) |
|
Emerging |
| 496 |
jkflying/opencalibration
A fast, scalable and deterministic camera calibration library for aerial... |
|
Emerging |
| 497 |
Running-Turtle1/jittor-retinanet
A Jittor implementation of the RetinaNet. |
|
Emerging |
| 498 |
HsiehYiChia/Scene-text-recognition
Scene text detection and recognition based on Extremal Region(ER) |
|
Emerging |
| 499 |
KevinLTT/video2bvh
Extracts human motion in video and save it as bvh mocap file. |
|
Emerging |
| 500 |
RG-O/YoutubeOverCommercials
Browser extension that automatically blocks TV commercials and plays YouTube... |
|
Emerging |