Zelili AI

Yume 1.5

A Text Controlled Interactive World Generation Model.
Founder: Xiaofeng Mao, Kaipeng Zhang, and team (Shanghai AI Laboratory/Fudan University)
Tool Release Date
Dec 2025
Tool Users
15K+
Pricing Model

Starting Price

$0/Month

About This AI

Yume 1.5 is an advanced “World Model” that functions like a playable video game generated entirely by AI in real time.

Unlike standard video generators that create a passive movie clip, Yume 1.5 generates an interactive, consistent 3D like environment that users can explore using keyboard controls (WASD) and modify with text commands.

It uses a novel architecture called “Joint Temporal Spatial Channel Modeling” (TSCM) to remember infinite context, ensuring that if you turn around in the virtual world, the scenery remains consistent with what was there before.

Pricing

Pricing Model

Starting Price

$0/Month

Key Features

  1. Interactive Exploration Allows users to navigate the generated video world in real time using keyboard inputs (WASD for movement), similar to a first person game.
  2. Text Controlled Events Users can type commands to trigger specific events within the world instantly (e.g., "it starts raining," "a cat runs across the street").
  3. Infinite Context Uses TSCM to compress historical frames, allowing for theoretically infinite video generation without the "amnesia" common in older models.
  4. Real Time Speed Optimized with bidirectional attention distillation to run at playable frame rates (approx. 12fps on A100) on single GPUs.
  5. Single Image Start Can build an entire dynamic world starting from just one uploaded image or a simple text prompt.
  6. Dual Stream Attention Separates action processing from visual rendering to prevent control inputs from distorting the image quality.

Pros

  1. The first open source model to combine high quality video generation with game like interactivity.
  2. Solves the "long video consistency" problem better than standard diffusion models.
  3. Completely free and open source (Apache 2.0).
  4. Highly responsive to complex text instructions for environmental changes.

Cons

  1. Requires high end GPU hardware (e.g., A100 or H100) for smooth real time performance.
  2. Resolution is currently limited (often 720p) compared to non interactive renderers.
  3. No user friendly app for non coders (requires Python/CLI setup).
  4. As a research demo, physics interactions are visual only (no true collision detection engine).
Best for AI Researchers, Game Developers, and simulation engineers looking to experiment with “playable” generative video and neural game engines.

FAQs

  • Is Yume 1.5 a game engine?

    In a way, yes. It is a “neural game engine.” Instead of rendering polygons like Unity or Unreal, it “dreams” the next frame of the video based on your inputs, allowing for infinite open world exploration without 3D assets.

  • Is Yume 1.5 free?

    Yes, the code and model weights (Yume 5B 720P) are open source and available for free on GitHub and Hugging Face.

  • Can I run Yume 1.5 on my PC?

    It is very demanding. While the code is available, running it at playable speeds currently requires enterprise grade GPUs (like an NVIDIA A100), though “Lite” versions for consumer cards may be developed by the community.

  • Who created Yume 1.5?

    It was developed by researchers at the Shanghai AI Laboratory and Fudan University, led by Xiaofeng Mao and Kaipeng Zhang.

Yume 1.5 Alternatives

Newly Added

Autodraft AI

GlimpRouter

Yume 1.5 Latest News

Weekly Poll

Yume 1.5 Review

0.0
0.0 out of 5 stars (based on 0 reviews)
Excellent0%
Very good0%
Average0%
Poor0%
Terrible0%

There are no reviews yet. Be the first one to write one.

Newly Added Tools

Autodraft AI

GlimpRouter

Flux.2 Dev Turbo

GLM-Image