Trellis 2

State-of-the-Art Open-Source Image-to-3D Generation – High-Fidelity Textured Assets with Native Structured Latents
Last Updated: December 31, 2025
By Zelili AI

About This AI

Trellis 2 is Microsoft’s open-source large 3D generative model with 4 billion parameters, specializing in high-fidelity image-to-3D generation.

It transforms a single 2D image into fully textured 3D assets with complex topologies, sharp features, and complete PBR (Physically Based Rendering) materials including base color, roughness, metallic, and opacity.

The model introduces a novel ‘field-free’ sparse voxel structure called O-Voxel that encodes both geometry and appearance natively, avoiding limitations of traditional signed distance fields.

Powered by a Sparse Compression VAE with 16x spatial downsampling and flow-matching transformers, it achieves exceptional efficiency and quality, producing assets up to 1536 cubed resolution.

Generation times are fast on high-end GPUs (e.g., 3 seconds for 512 cubed, 17 seconds for 1024 cubed, 60 seconds for 1536 cubed on H100).

Key strengths include handling arbitrary topologies (open surfaces, non-manifold geometry, interiors), rich texturing, and minimalist processing pipelines for fast mesh-to-latent and latent-to-mesh conversions.

Released in December 2025 under MIT license with full code, weights on Hugging Face, and GitHub repo, it supports academic and research use only.

Demos available on Hugging Face Spaces allow testing image-to-3D conversion, showcasing diverse assets like characters, objects, and scenes with production-ready detail.

Ideal for 3D artists, game developers, VFX professionals, researchers, and creators seeking efficient, high-quality 3D asset generation without proprietary tools.

Key Features

  1. High-fidelity image-to-3D generation: Converts single 2D images into detailed textured 3D models with PBR materials
  2. O-Voxel structured latents: Novel field-free sparse voxel representation for geometry and appearance encoding
  3. Sparse Compression VAE: 16x spatial downsampling for compact latent space and efficient processing
  4. Arbitrary topology support: Handles complex shapes, open surfaces, interiors, and non-manifold geometry
  5. Full PBR texturing: Generates base color, roughness, metallic, and opacity for realistic relighting
  6. Fast mesh-latent conversion: Rendering-free bidirectional processing (mesh to O-Voxel smaller than 10s CPU, O-Voxel to mesh less than 100ms CUDA)
  7. High-resolution output: Supports up to 1536 cubed voxel resolution for detailed assets
  8. Efficient generation pipeline: Flow-matching transformers for scalable, high-quality results
  9. Open-source accessibility: MIT license with code, weights, and demos on GitHub/Hugging Face
  10. Robust to diverse inputs: Works on real photos, renders, and varied object categories

Price Plans

  1. Free ($0): Fully open-source under MIT license with model weights, code, and demos available on GitHub and Hugging Face for research/academic use; no fees or subscriptions
  2. Commercial (Not Available): Explicitly restricted to non-commercial research; no paid tiers or enterprise options

Pros

  1. Exceptional quality and detail: Produces production-ready textured 3D assets with sharp features and complex structures
  2. High efficiency: Fast generation times and compact latents make it practical on high-end hardware
  3. Topology flexibility: Breaks traditional limitations by handling arbitrary shapes and open geometry
  4. Rich PBR materials: Full physically based rendering attributes for realistic lighting and appearance
  5. Fully open-source: MIT license enables research, reproduction, and community extensions
  6. Minimalist pipeline: No rendering or heavy optimization needed for conversions
  7. Strong benchmark performance: Outperforms many prior open and some proprietary image-to-3D models

Cons

  1. Research-only license: Explicitly for academic/research use; no commercial deployment allowed
  2. High hardware requirements: Needs powerful GPUs (e.g., H100 for optimal speeds) for high-res generation
  3. Generation time scales with resolution: 60 seconds for max 1536 cubed on top hardware
  4. Setup and inference complexity: Requires technical expertise for local running via GitHub code
  5. No hosted easy interface: Demos on Hugging Face but full power needs custom setup
  6. Potential inconsistencies: May require prompt tuning for perfect results on challenging inputs
  7. Limited adoption metrics: Recent release with growing but unquantified community use

Use Cases

  1. 3D asset creation for games/VFX: Generate detailed models from concept images quickly
  2. Research in 3D generation: Experiment with structured latents and topology handling
  3. Product visualization: Convert product photos to rotatable 3D models with textures
  4. Digital prototyping: Rapidly create 3D versions of sketches or renders
  5. AR/VR content development: Produce textured assets for immersive environments
  6. Dataset augmentation: Generate synthetic 3D variations from real images
  7. Educational tools: Demonstrate advanced 3D AI techniques in courses

Target Audience

  1. 3D artists and VFX professionals: Seeking fast high-quality asset generation
  2. Game developers: Prototyping models without manual sculpting
  3. AI/3D researchers: Studying or extending state-of-the-art generation models
  4. Digital creators and designers: Converting 2D concepts to 3D quickly
  5. Academic institutions: Using for computer vision/graphics research
  6. Tech enthusiasts: Experimenting with open-source 3D AI locally

How To Use

  1. Visit GitHub: Go to github.com/microsoft/TRELLIS.2 for code, docs, and setup instructions
  2. Download model: Get weights from Hugging Face (microsoft/TRELLIS.2-4B)
  3. Install environment: Set up Python with required dependencies (PyTorch, etc.) per repo guide
  4. Run inference: Use provided scripts or example.py to load model and generate from image input
  5. Provide input: Upload a single image or use prompt for generation
  6. Process output: Receive textured 3D mesh in formats like .obj/.glb with PBR maps
  7. Try demo: Use Hugging Face Space for quick no-setup testing

How we rated Trellis 2

  • Performance: 4.8/5
  • Accuracy: 4.7/5
  • Features: 4.6/5
  • Cost-Efficiency: 5.0/5
  • Ease of Use: 4.3/5
  • Customization: 4.5/5
  • Data Privacy: 5.0/5
  • Support: 4.2/5
  • Integration: 4.4/5
  • Overall Score: 4.7/5

Trellis 2 integration with other tools

  1. Hugging Face: Model weights and demo spaces for easy testing and inference
  2. GitHub Repository: Full code, training/inference scripts, and community issues/discussions
  3. 3D Software (Potential): Outputs compatible with Blender, Unity, Unreal Engine via standard mesh formats
  4. ComfyUI / Automatic1111: Community workflows for integration into popular AI image/3D pipelines
  5. Local GPU Setup: Runs natively on CUDA-enabled hardware; no cloud required for core use

Best prompts optimised for Trellis 2

  1. A highly detailed futuristic robot with metallic surfaces and glowing accents, front view photo on white background, generate full PBR textured 3D model
  2. Ancient stone statue of a warrior in dynamic pose, dramatic lighting, archaeological photo style, produce high-resolution 3D asset with realistic materials
  3. Cute cartoon fox character standing alert, vibrant fur colors, game-ready style reference image, convert to textured 3D model with sharp details
  4. Realistic sports car in red, side profile studio shot with reflections, automotive photography, create production-quality 3D model with PBR textures
  5. Fantasy dragon perched on rock, scales and wings detailed, epic lighting, concept art photo, generate complex topology 3D asset with full materials
Trellis 2 delivers outstanding open-source image-to-3D generation with high-fidelity PBR textures, complex topology handling, and efficient processing that rivals proprietary tools. Its O-Voxel innovation enables sharp, detailed assets quickly. Ideal for research and prototyping; hardware demands and research-only license limit broader use, but it’s a major advancement for accessible 3D AI.

FAQs

  • What is Trellis 2?

    Trellis 2 is Microsoft’s open-source 4B-parameter image-to-3D generative model that creates high-fidelity textured 3D assets with PBR materials from single images.

  • When was Trellis 2 released?

    Trellis 2 was released on December 16, 2025, with code, weights, and paper made public for research use.

  • Is Trellis 2 free to use?

    Yes, it is completely open-source under MIT license for academic and research purposes, with full access to model weights and code on GitHub/Hugging Face.

  • What are the key innovations in Trellis 2?

    It introduces O-Voxel sparse voxel structure for field-free geometry and appearance encoding, plus Sparse Compression VAE with 16x downsampling for efficient high-res generation.

  • What resolutions does Trellis 2 support?

    It generates assets up to 1536 cubed voxels, with fast times like 3s for 512 cubed and 60s for max res on H100 GPUs.

  • Can Trellis 2 be used commercially?

    No, it is explicitly for research and academic use only; commercial applications are not permitted under the release terms.

  • How does Trellis 2 handle complex shapes?

    The O-Voxel representation supports arbitrary topologies including open surfaces, interiors, and non-manifold geometry without traditional field limitations.

  • Where can I try Trellis 2?

    Try the demo on Hugging Face Spaces (microsoft/TRELLIS.2) or run locally via the GitHub repo after setup.

Newly Added Tools​

Qwen-Image-2.0

$0/Month

Qodo AI

$0/Month

Codiga

$10/Month

Tabnine

$59/Month
Trellis 2 Alternatives

Qwen-Image-2.0

$0/Month

GLM-OCR

$0/Month

Lummi AI

$10/Month

About Author

Hi Guys! We are a group of ML Engineers by profession with years of experience exploring and building AI tools, LLMs, and generative technologies. We analyze new tools not just as a user, but as someone who understands their technical depth and real-world value.We know how overwhelming these tools can be for most people, that’s why we break down complex AI concepts into simple, practical insights. Our goal is to help you discover these magical AI tools that actually save your time and make everyday work smarter, not harder.“We don’t just write about AI: We build, test and simplify it for you.”