More
    HomeAI Papers

    AI Papers

    FlipSketch: Breathing Life Into Your Doodles

    Sketch animation with AI-powered simplicity and creativity. Effortless Animation: FlipSketch transforms static sketches into smooth animations with just a drawing and a text description. AI Innovation: Combines text-to-video...

    RedPajama: The Future of Transparent and Open-Source Language Model Training

    How RedPajama datasets are redefining AI development with transparency, scalability, and versatility. Transparency in AI Training: RedPajama introduces an unprecedented level of openness in dataset composition,...

    AnimateAnything: Transforming Video Creation with Seamless Control and Precision

    The groundbreaking framework for consistent, customizable video generation opens new doors for filmmakers and VR designers. Versatile Control: AnimateAnything enables precise video manipulation through camera trajectories,...

    SAMPart3D: A Breakthrough in Zero-Shot 3D Object Segmentation for Complex Models

    Achieving scalable and flexible part-level segmentation without text prompts, SAMPart3D enables advanced 3D editing and model customization. Text-Free, Scalable Segmentation: SAMPart3D removes the need for...

    OMNI-EDIT: The Ultimate Image Editor with Multi-Task Capabilities for Any Aspect Ratio

    OMNI-EDIT leverages specialist guidance to tackle seven unique editing tasks, achieving unprecedented accuracy and quality in real-world image editing. Multi-Task Capability: OMNI-EDIT is designed to...

    Introducing StdGEN: Game-Changing 3D Character Generation from Single Images with Full Semantic Control

    StdGEN offers an advanced pipeline for high-quality, semantically decomposed 3D characters ready for gaming, VR, and film production. Fast, High-Quality 3D Generation: StdGEN creates intricately...

    TIP-I2V: The World’s Largest Dataset for Image-to-Video AI Research

    A Million-Scale Dataset Brings New Potential to Image-to-Video Generation Models Unprecedented Scale and Scope: TIP-I2V introduces over 1.7 million unique text and image prompts for...

    X-Portrait 2: Expressive Portrait Animation with Next-Level Realism and Emotion

    From subtle smirks to bold expressions, X-Portrait 2 transforms static images into lifelike animations for film, virtual agents, and more Advanced Expression Encoding: X-Portrait 2...

    GarVerseLOD: 3D Garment Reconstruction from a Single Image with High-Fidelity Detail Levels

    This groundbreaking dataset and framework achieve robust garment modeling from in-the-wild images, addressing challenges of complex poses and deformations. Advanced Dataset: GarVerseLOD introduces a large-scale...

    Microsoft’s Magentic-One: The New Open-Source AI Platform Redefining Autonomous Task Management

    With its new modular architecture, Magentic-One tackles complex tasks across domains, promising a future of AI-driven workflows. Multi-Agent Capabilities: Magentic-One uses a modular, multi-agent design,...

    HelloMeme Meme Video Creation with Spatial Knitting Attentions in Diffusion Models

    New AI Method Embeds High-Fidelity Visuals and Exaggerated Expressions, Opening Doors for Creative and Open-Source Applications Spatial Knitting Attentions: HelloMeme introduces spatial knitting attention mechanisms...

    Unlocking Multi-Intent Detection: Qualcomm’s Pointer Network AI Conversations

    A New Approach to Extract and Identify Multiple Intents in Complex User Queries Qualcomm’s research introduces a Pointer Network-based system designed to handle multiple intents within a...