More
    HomeAI Papers

    AI Papers

    MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences

    Revolutionizing Long-Form Video Generation with MovieDreamer MovieDreamer combines autoregressive models and diffusion rendering for long-duration video generation. The framework ensures narrative coherence and character consistency across...

    OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person

    Revolutionizing the Virtual Fashion Experience OutfitAnyone utilizes a two-stream conditional diffusion model for lifelike virtual try-on experiences. The technology adapts to various body shapes and poses,...

    Apple Showcases Open AI Capabilities with New Models

    New Apple AI Models Outperform Competitors Mistral and Hugging Face Apple releases DCLM models on Hugging Face, featuring 7 billion and 1.4 billion parameter variants. The...

    Click-Gaussian: Revolutionizing 3D Segmentation

    Real-Time, Fine-Grained 3D Scene Manipulation Made Possible Click-Gaussian enables rapid and accurate segmentation of 3D Gaussians. The Global Feature-guided Learning (GFL) method enhances segmentation accuracy. The method...

    IBM: Scaling Granite Code Models to 128K Context

    Expanding Context Windows in Open-Source Code Models IBM introduces Granite code models supporting up to 128K token context windows. Lightweight continual pretraining and instruction tuning enhance...

    AI Enhances Doctor-Patient Communication

    Study shows AI-generated responses outperform human doctors in empathy but raise readability concerns Superior Empathy: AI-generated responses rated higher in empathy compared to those written...

    Advancing Humanoid Control for Diverse Object Manipulation

    New method enhances simulated humanoid's ability to grasp and transport varied objects Innovative Control Method: Introduces a controller for simulated humanoids to grasp and follow...

    GRUtopia: Revolutionizing 3D Robotics with Simulated City-Scale Environments

    New Simulation Platform Enhances Robot Training and Performance in Diverse Real-World Scenario Advanced Simulation for Robots: GRUtopia offers a simulated 3D society with diverse, interactive...

    StyleSplat Revolutionizes 3D Object Style Transfer

    New Method Enhances Creative Expression with Quick and Customizable 3D Stylization Innovative 3D Style Transfer: StyleSplat introduces a lightweight method for applying artistic styles to...

    NATO’s Revised AI Strategy Aims for Safe and Responsible Use of Advanced Technologies

    New Focus on AI Interoperability, Responsible Use, and Combating Disinformation Enhanced Interoperability and Collaboration: NATO's updated strategy emphasizes interoperability between AI systems and closer cooperation...

    WildGaussians: Advancing 3D Scene Reconstruction in Real-World Environments

    Tackling Occlusions and Dynamic Changes for Photorealistic 3D Rendering Innovative Approach: WildGaussians integrates robust DINO features with 3D Gaussian Splatting for real-time, photorealistic 3D scene...

    SEED-Story: Advancing Multimodal Long Story Generation with AI

    Integrating Text and Images Seamlessly for Enhanced Storytelling Innovative Multimodal Approach: SEED-Story uses a Multimodal Large Language Model (MLLM) to generate coherent, long sequences of...