More
    HomeAI Papers

    AI Papers

    YOUDREAM: Text-to-3D Animal Generation

    A breakthrough in 3D generation with text-to-image diffusion models YOUDREAM generates high-quality, anatomically controllable 3D animals using a text-to-image diffusion model guided by 2D views...

    Florence-2: Vision Tasks with Unified Representation

    Florence-2 integrates diverse vision and vision-language tasks through a novel prompt-based model. Florence-2 utilizes a unified, prompt-based approach for various vision and vision-language tasks. The model...

    Blood Test Could Predict Parkinson’s Seven Years Before Symptoms

    Early detection aims to revolutionize treatment and prevention Advanced Detection: A new blood test developed by researchers can predict Parkinson’s disease up to seven years before...

    Glyph-ByT5-v2 Sets New Standard for Multilingual Text Rendering

    The breakthrough in visual text rendering supports 10 languages with improved aesthetic quality Multilingual Capability: Glyph-ByT5-v2 and Glyph-SDXL-v2 accurately render text in 10 languages. Enhanced Aesthetics: The models...

    People Struggle to Differentiate Between Humans and AI in Five-Minute Chats

    The Surprising Accuracy of GPT-4 in Mimicking Human Conversation Confounding Conversations: In Turing test experiments, participants mistook GPT-4 for a human 54% of the time. Rising Concerns: The...

    ChatGPT’s Impact on the Freelance Market: A Looming Challenge

    How AI is Reshaping the Demand for Digital Freelancers Significant Decline: A 21% drop in demand for digital freelancers in writing and coding since ChatGPT's launch. Automation-Prone...

    EMMA Image Generation with Multi-Modal Prompts

    How the New AI Model Balances Text and Visual Inputs for Superior Results EMMA integrates multi-modal prompts, combining text with additional visual cues for image...

    Can AI Understand Commonsense?

    Challenging Text-to-Image Models with Real-Life Scenarios Commonsense-T2I evaluates if text-to-image models can produce images based on common sense. Current state-of-the-art models struggle with accuracy, highlighting a...

    Physics3D: 3D Object Simulation with Video Diffusion Models

    Bridging the Gap Between Real and Virtual Physics Physics3D integrates physical properties into 3D object modeling for realistic simulations. The method utilizes a video diffusion model...

    Imitative Editing: Transforming Image Editing with MimicBrush

    Image Editing with AI-Powered Imitative Techniques MimicBrush introduces "imitative editing," allowing users to edit images using reference images without the need for detailed text descriptions. The...

    Microsoft presents VALL-E 2: The Next Step in Zero-Shot Text-to-Speech Synthesis

    Achieving Human Parity with Advanced Neural Codec Language Models Human Parity Achieved: VALL-E 2 marks the first instance of achieving human parity in zero-shot text-to-speech synthesis. Enhanced...

    Advancing AI Art: Multistep Consistency Distillation of Latent Diffusion Models

    New Model Enhances Image Synthesis Speed and Quality Unified Model: MLCM offers a single model for various sampling steps, improving efficiency. Progressive Training: Enhances inter-segment consistency, boosting image...