HomeAI Papers

AI Papers

START: AI Reasoning with Self-Taught Tool Mastery

How a New Self-Learning Framework Combats Hallucinations and Supercharges Problem-Solving in LLMs START integrates external tools like code execution to tackle hallucinations and inefficiencies in Large...

AppAgentX: The Next Leap in Smartphone Automation—Bridging AI Brains and Efficiency

How Evolutionary Algorithms Are Teaching AI to Master Your Phone Like a Pro Efficiency Meets Intelligence: AppAgentX merges the adaptability of LLM-based agents with the...

GHOST 2.0: The Neural Makeover Engine Redefining Identity in the Digital Age

Generative AI Breaks New Ground with Hyper-Realistic, One-Shot Head Transfers Using Neural Networks Next-Level Realism: GHOST 2.0 tackles the toughest challenges in head swapping—preserving identity,...

Infinite-Context AI: China’s MoBA Breakthrough

When Mixture of Experts Meets Sparse Attention - A Paradigm Shift for Enterprise-Grade RAG Systems MoBA's Hybrid Architecture combines MoE efficiency with dynamic attention routing to...

From Pixels to Reality: How CAST Reconstructs 3D Worlds from a Single Image

A Breakthrough in AI-Powered Scene Recovery Combines Spatial Intelligence and Physics for Unprecedented Realism Scene Understanding Redefined: CAST combines 2D segmentation, depth analysis, and GPT-powered...

The Hidden Flaw in AI Safety: Why Safeguarded Ships Run Aground

Aligned Large Language Models’ Safety Mechanisms Tend to Be Anchored in The Template Region Template-Anchored Vulnerabilities: Safety-aligned LLMs rely disproportionately on fixed response templates, creating...

Face Readers: How AI is Unlocking the Emotional Lives of Animals

Artificial intelligence is revolutionizing animal welfare by interpreting emotions like pain, stress, and even happiness. Could this technology help us better care for animals—and...

Beyond the Hype: Why Long-Context AI Models Struggle with Real Reasoning

NOLIMA Exposes Critical Flaws in LLMs’ Ability to Infer and Link Information in Extended Texts Literal Matches ≠ Real Intelligence: While LLMs ace "needle-in-a-haystack" tests...

Fast Video Generation with Sliding Tile Attention

Revolutionizing Video Diffusion with Efficiency and Speed Sliding Tile Attention (STA) drastically reduces the computational cost of video generation in Diffusion Transformers (DiTs) by focusing...

OmniHuman-1: The Future of AI-Generated Human Animation

Can ByteDance’s Breakthrough Outperform OpenAI’s Sora and Google’s Veo? OmniHuman-1 is a revolutionary AI model that transforms a single image into a lifelike video of a...

AI Predicts Cancer Outcomes Using Clinical Notes and Genomic Data

How Artificial Intelligence is Transforming Cancer Prognosis and Treatment AI-powered models using clinical notes and genomic data can predict cancer survival and treatment outcomes with...

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Code Model Training with Reinforcement Learning and Automated Test-Case Generation Unlocking RL Potential in Code Models: ACECODER addresses the untapped potential of reinforcement learning (RL)...