More
    HomeAI Papers

    AI Papers

    Unraveling the Digital Playground: Generative AI’s Impact on Children

    Navigating the Promises and Perils of AI in Young Lives Generative AI is increasingly integrated into children's lives through tools like ChatGPT and Dall-E, with...

    Astra: Robot Navigation with Smart, Adaptive Tech

    How Hierarchical Multimodal Learning Powers the Future of Mobile Robots Astra introduces a groundbreaking dual-model architecture, Astra-Global and Astra-Local, to tackle the challenges of robot...

    Crafting the Future: PartCrafter’s Pioneering Approach to 3D Mesh Innovation

    Transforming Single Images into Decomposable 3D Models with Unprecedented Precision PartCrafter introduces a groundbreaking approach to 3D modeling by generating multiple semantically meaningful and geometrically...

    ComfyUI-Copilot: AI Art Creation with Intelligent Automation

    Streamlining Workflow Development for Beginners and Experts Alike ComfyUI-Copilot is an innovative, large language model (LLM)-powered plugin designed to simplify the complexities of ComfyUI, an...

    Video Creation: The Power of Temporal In-Context Fine-Tuning

    Unlocking Versatile Control in Video Diffusion Models with TIC-FT Temporal In-Context Fine-Tuning (TIC-FT) introduces a groundbreaking, efficient method for adapting pretrained video diffusion models to...

    DexUMI: Robotics with the Human Hand as the Ultimate Interface

    Bridging the Embodiment Gap for Dexterous Robot Manipulation DexUMI is a groundbreaking framework that uses the human hand as a universal interface to transfer dexterous...

    The Entropy Enigma: Unlocking the Future of Reasoning in Language Models

    How Managing Policy Entropy Could Revolutionize Reinforcement Learning for LLMs Policy entropy collapse in reinforcement learning (RL) for large language models (LLMs) severely limits exploratory...

    One RL to See Them All: Unifying Visual Reasoning and Perception in AI

    V-Triune's innovative reinforcement learning system empowers vision-language models to master both complex thought and detailed sight, heralding a new era of versatile AI. Unified Training...

    Robin: Science with AI-Driven Discovery

    How a Multi-Agent System is Automating the Future of Therapeutic Innovation Robin, the first multi-agent AI system, fully automates the scientific discovery process by integrating...

    Virtual Worlds: CAST and the Future of 3D Scene Reconstruction

    Transforming Single RGB Images into Realistic 3D Environments with Component-Aligned Technology CAST (Component-Aligned 3D Scene Reconstruction) introduces a groundbreaking method to create high-quality 3D scenes...

    Fast Text-to-Audio Generation with Adversarial Post-Training

    Revolutionizing Audio Creation with Speed and Diversity Text-to-audio systems, despite their impressive performance, suffer from slow inference times, rendering them impractical for many creative applications,...

    AbsoluteZero: Transforming AI Reasoning with Self-Play and Zero Data

    How the Absolute Zero Paradigm Redefines Learning Without Human Input The Absolute Zero paradigm introduces a groundbreaking approach to AI reasoning, enabling large language models...