More
    HomeAI Papers

    AI Papers

    Motion Anything From Google: Motion Generation with Multimodal Control

    How Attention-Based Mask Modeling and a New Dataset Are Redefining the Future of Motion Synthesis Motion Anything introduces an Attention-Based Mask Modeling approach, enabling fine-grained...

    AI Unlocks Quantum Entanglement: A Leap Toward the Quantum Internet

    How Artificial Intelligence Simplified Einstein’s ‘Spooky Action’ and Paved the Way for Next-Gen Communication AI-Powered Breakthrough: Researchers used PyTheus, an AI tool, to design a...

    EgoLife: The AI Assistant That Sees the World Through Your Eyes

    How Wearable Glasses and Multimodal Data Are Pioneering the Future of Personal Efficiency A 300-Hour Window into Daily Life: The EgoLife Dataset captures six participants’...

    START: AI Reasoning with Self-Taught Tool Mastery

    How a New Self-Learning Framework Combats Hallucinations and Supercharges Problem-Solving in LLMs START integrates external tools like code execution to tackle hallucinations and inefficiencies in Large...

    AppAgentX: The Next Leap in Smartphone Automation—Bridging AI Brains and Efficiency

    How Evolutionary Algorithms Are Teaching AI to Master Your Phone Like a Pro Efficiency Meets Intelligence: AppAgentX merges the adaptability of LLM-based agents with the...

    GHOST 2.0: The Neural Makeover Engine Redefining Identity in the Digital Age

    Generative AI Breaks New Ground with Hyper-Realistic, One-Shot Head Transfers Using Neural Networks Next-Level Realism: GHOST 2.0 tackles the toughest challenges in head swapping—preserving identity,...

    Infinite-Context AI: China’s MoBA Breakthrough

    When Mixture of Experts Meets Sparse Attention - A Paradigm Shift for Enterprise-Grade RAG Systems MoBA's Hybrid Architecture combines MoE efficiency with dynamic attention routing to...

    From Pixels to Reality: How CAST Reconstructs 3D Worlds from a Single Image

    A Breakthrough in AI-Powered Scene Recovery Combines Spatial Intelligence and Physics for Unprecedented Realism Scene Understanding Redefined: CAST combines 2D segmentation, depth analysis, and GPT-powered...

    The Hidden Flaw in AI Safety: Why Safeguarded Ships Run Aground

    Aligned Large Language Models’ Safety Mechanisms Tend to Be Anchored in The Template Region Template-Anchored Vulnerabilities: Safety-aligned LLMs rely disproportionately on fixed response templates, creating...

    Face Readers: How AI is Unlocking the Emotional Lives of Animals

    Artificial intelligence is revolutionizing animal welfare by interpreting emotions like pain, stress, and even happiness. Could this technology help us better care for animals—and...

    Beyond the Hype: Why Long-Context AI Models Struggle with Real Reasoning

    NOLIMA Exposes Critical Flaws in LLMs’ Ability to Infer and Link Information in Extended Texts Literal Matches ≠ Real Intelligence: While LLMs ace "needle-in-a-haystack" tests...

    Fast Video Generation with Sliding Tile Attention

    Revolutionizing Video Diffusion with Efficiency and Speed Sliding Tile Attention (STA) drastically reduces the computational cost of video generation in Diffusion Transformers (DiTs) by focusing...