A breakthrough deep-learning model tracks 5,000 fruit fly cells with 90% accuracy, paving the way for early disease detection in human tissues.
A "Dual-Graph" Innovation: MIT...
A new 3B parameter model uses a novel "analyze-then-parse" approach to master complex layouts with pixel-level precision.
Universal Understanding: Dolphin-v2 is a lightweight (3B parameter) model...
Unlocking the power of immersive storytelling, robotics, and AR by synthesizing realistic egocentric perspectives from standard footage.
Immersive Transformation: EgoX is a groundbreaking framework that generates...
How a simple prompting trick called Verbalized Sampling overcomes the "Typicality Bias" that makes LLMs predictable.
The Root Cause: Research identifies "Typicality Bias"—a cognitive psychological tendency...
Introducing a new era of open-source multimodal models featuring native tool use, massive context windows, and real-world agentic capabilities.
A Dual-Model Release: The GLM-4.6V series launches...
Bridging the gap between precision and flexibility with a novel "Chain-of-Frames" approach.
The Precision Paradox: Current video editing AI faces a critical trade-off between expert models...
Moving beyond handcrafted graphics to real-time, text-guided world building.
Bridging the AR Gap: EgoEdit addresses the unique challenges of first-person (egocentric) footage—such as rapid motion and...
Bytedance’s new model redefines video creation with unprecedented spatio-temporal precision.
Beyond Simple Editing: Vidi2 can ingest hours of raw footage and a simple prompt to autonomously...
Revolutionizing video generation by solving the "first-frame" problem and harmonizing motion with identity.
Paradigm Shift: SteadyDancer moves away from the flawed Reference-to-Video (R2V) model to an...
Bridging Words and Visions to Create Smarter, More Adaptive Agents in a Complex World
Unified Prediction Framework: Dynalang redefines AI agents by using language not...
How AutoDeco Eliminates Manual Tweaks and Ushers in Truly End-to-End AI Creativity
Challenging the Status Quo: Current large language models (LLMs) aren't truly "end-to-end" due...
Unraveling the Hidden Dangers of Low-Quality Training Data in the Age of AI
The Brain Rot Hypothesis: Researchers propose and test a theory that exposing...