Code Model Training with Reinforcement Learning and Automated Test-Case Generation
Unlocking RL Potential in Code Models: ACECODER addresses the untapped potential of reinforcement learning (RL)...
How Multi-Image Synthetic Data and Shared Attention Mechanisms Are Redefining AI-Generated Imagery
Synthetic Dataset Innovation: A new Synthetic Customization Dataset (SynCD) leverages 3D assets and...
Large Language Models Are Vulnerable to Stealthy Attacks That Undermine Safety Alignment
Guardrails Aren’t Enough: Despite guardrail moderation systems designed to filter harmful data, a...
How Image Diffusion Models Are Now Powering Scalable, High-Fidelity 3D Gaussian Splat Generation
2D to 3D Leap: DiffSplat repurposes massive image diffusion models to generate...
From Script to Screen: The Rise of LLM-Powered Virtual Filmmaking
Automated Creativity: FilmAgent leverages AI agents to mimic traditional film crew roles, streamlining tasks from...
ByteDance introduces a state-of-the-art solution for consistent, high-quality depth estimation in extended videos.
Long-Video Capability: Video Depth Anything tackles the challenge of estimating depth for videos...
An open-source toolkit making advanced audio generation accessible to all.
Accessible Innovation: Amphion simplifies audio, music, and speech generation for beginners and experts alike.
Comprehensive Tools: It supports...
China's DeepSeek R1 delivers state-of-the-art reasoning at a fraction of the cost, setting a new standard for open-source AI innovation.
Revolutionary Reasoning: DeepSeek R1, trained...
A breakthrough framework for generating vibrant Live2D cartoon characters using the power of AI-driven text-to-image technology.
Innovative Character Generation: Textoon transforms textual descriptions into vivid...
A zero-shot pipeline brings lifelike human animation with dynamic details and enhanced realism.
Innovative Animation Pipeline: X-Dyna integrates facial expressions, body movements, and environmental dynamics...
How a Modality-Specific Normalization Strategy is Redefining Scalable and High-Quality Virtual Try-On
Revolutionizing Virtual Try-On (VTON): A new single-network paradigm eliminates the need for dual networks,...