Flow Matching · Multi-Modal Input · Task-Specific Decoders · A100 40GB
| Dataset | Size | Content | Used by |
|---|---|---|---|
| Objaverse-XL | 10M+ | Massive diverse 3D objects | AtlasForgeCastLens |
| Objaverse | 800K+ | Diverse annotated 3D assets | ForgeCastLens |
| ShapeNet | 55K | Common object categories | ForgeCast |
| ScanNet / ScanNet++ | 1.5K scenes | Indoor 3D scans (RGB-D) | AtlasLens |
| KITTI / nuScenes | 40K frames | Outdoor driving 3D scenes | AtlasLens |
| ABO (Amazon Berkeley) | 148K | Product meshes + materials | Forge |
| Thingiverse | 2M+ | 3D printable STL models | Cast |
| Polycam Scans | ~500K | Real-world 3DGS / NeRF captures | Lens |
| Synthetic Renders | Generated | Multi-view renders of Objaverse | AtlasForgeCastLens |
| Text-3D Pairs (synthetic) | Generated | GPT-4o descriptions of Objaverse | AtlasForgeCastLens |