WangLibo1995/GeoSeg

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image and UAV image segmentation.

/ 100

Emerging

# Technical Summary Implements a unified PyTorch Lightning and timm-based framework supporting hybrid architectures including state-space models (PyramidMamba), vision transformers (UNetFormer, DC-Swin), and CNNs for remote sensing segmentation across ISPRS, UAVid, LoveDA, and OpenEarthMap datasets. Provides multi-scale training/testing pipelines and inference optimization for large-scale geospatial imagery through patch-based processing with configurable stride and tile sizes. Architecture choices prioritize efficiency through encoder-decoder designs with dense connection modules (MANet, ABCNet) and feature pyramid mechanisms (A2FPN) to balance computational cost against segmentation accuracy on high-resolution aerial and satellite data.

1,046 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

1,046

Forks

150

Language

Python

License

GPL-3.0

Related tools

cvjena/cn24

Convolutional (Patch) Networks for Semantic Segmentation

TUI-NICR/EMSANet

EMSANet: Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments

qizhuli/Weakly-Supervised-Panoptic-Segmentation

Weakly- and Semi-Supervised Panoptic Segmentation (ECCV18)

nmhaddad/semantic-segmentation

Off-Road Perception with DeepLabV3+

liuziwei7/region-conv

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights