A New Benchmark Tests AI’s Ability to Answer Real-Time Visual Questions
Introducing LIVEVQA – A groundbreaking dataset of 3,602 visual questions sourced from live news, designed...
Unleashing the Power of Diffusion Priors for Consistent Geometry Estimation
GeometryCrafter introduces a novel framework that recovers high-fidelity point map sequences with temporal coherence from...
Transforming Text into Meshes in Seconds with Stable Diffusion
PRD enables the adaptation of SD into a native 3D generator, eliminating the need for 3D...
How URAE is Redefining Text-to-Image Diffusion Models with Data and Parameter Efficiency
Synthetic Data as a Game-Changer: URAE leverages synthetic data from teacher models to significantly...
A Breakthrough in Equivariant Tightness Fitting for Clothed Humans Promises Unprecedented Accuracy and Generalization
ETCH introduces a novel pipeline that leverages equivariant tightness vectors to map...
Revolutionizing AI-Generated Content: Trajectory Distribution Matching for Few-Step Diffusion Models
A Unified Paradigm for Diffusion Models: Huawei’s Trajectory Distribution Matching (TDM) bridges the gap between distribution...
How Attention-Based Mask Modeling and a New Dataset Are Redefining the Future of Motion Synthesis
Motion Anything introduces an Attention-Based Mask Modeling approach, enabling fine-grained...
How Artificial Intelligence Simplified Einstein’s ‘Spooky Action’ and Paved the Way for Next-Gen Communication
AI-Powered Breakthrough: Researchers used PyTheus, an AI tool, to design a...
How Wearable Glasses and Multimodal Data Are Pioneering the Future of Personal Efficiency
A 300-Hour Window into Daily Life: The EgoLife Dataset captures six participants’...
How a New Self-Learning Framework Combats Hallucinations and Supercharges Problem-Solving in LLMs
START integrates external tools like code execution to tackle hallucinations and inefficiencies in Large...
How Evolutionary Algorithms Are Teaching AI to Master Your Phone Like a Pro
Efficiency Meets Intelligence: AppAgentX merges the adaptability of LLM-based agents with the...
Generative AI Breaks New Ground with Hyper-Realistic, One-Shot Head Transfers Using Neural Networks
Next-Level Realism: GHOST 2.0 tackles the toughest challenges in head swapping—preserving identity,...