WangLibo1995/GeoSeg
UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.
# Technical Summary Implements a unified PyTorch Lightning and timm-based framework supporting hybrid architectures including state-space models (PyramidMamba), vision transformers (UNetFormer, DC-Swin), and CNNs for remote sensing segmentation across ISPRS, UAVid, LoveDA, and OpenEarthMap datasets. Provides multi-scale training/testing pipelines and inference optimization for large-scale geospatial imagery through patch-based processing with configurable stride and tile sizes. Architecture choices prioritize efficiency through encoder-decoder designs with dense connection modules (MANet, ABCNet) and feature pyramid mechanisms (A2FPN) to balance computational cost against segmentation accuracy on high-resolution aerial and satellite data.
1,046 stars. No commits in the last 6 months.
Stars
1,046
Forks
150
Language
Python
License
GPL-3.0
Category
Last pushed
Aug 19, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/WangLibo1995/GeoSeg"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
cvjena/cn24
Convolutional (Patch) Networks for Semantic Segmentation
TUI-NICR/EMSANet
EMSANet: Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments
qizhuli/Weakly-Supervised-Panoptic-Segmentation
Weakly- and Semi-Supervised Panoptic Segmentation (ECCV18)
nmhaddad/semantic-segmentation
Off-Road Perception with DeepLabV3+
liuziwei7/region-conv
Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade