More

    Tech

    Google’s Gemini AI Goes Standalone on iPhone, Bringing Full AI Experience

    The new Gemini app for iPhone expands on previous integrations, adding Gemini Live, voice interaction, and premium features for advanced AI functionality. Standalone Gemini App:...

    AI Giants Face “Diminishing Returns” in Quest for Next-Gen Models as Apple Pursues Conservative Strategy

    OpenAI, Google, and Anthropic encounter hurdles in AI advancement, while Apple's focused approach may offer a sustainable path forward. Performance Challenges in New AI Models:...

    SAMPart3D: A Breakthrough in Zero-Shot 3D Object Segmentation for Complex Models

    Achieving scalable and flexible part-level segmentation without text prompts, SAMPart3D enables advanced 3D editing and model customization. Text-Free, Scalable Segmentation: SAMPart3D removes the need for...

    OMNI-EDIT: The Ultimate Image Editor with Multi-Task Capabilities for Any Aspect Ratio

    OMNI-EDIT leverages specialist guidance to tackle seven unique editing tasks, achieving unprecedented accuracy and quality in real-world image editing. Multi-Task Capability: OMNI-EDIT is designed to...

    Introducing StdGEN: Game-Changing 3D Character Generation from Single Images with Full Semantic Control

    StdGEN offers an advanced pipeline for high-quality, semantically decomposed 3D characters ready for gaming, VR, and film production. Fast, High-Quality 3D Generation: StdGEN creates intricately...

    Amazon Invests Up to $4 Billion in Anthropic to Boost AI Advancements

    Amazon’s investment secures a minority stake in Anthropic, deepening ties and prioritizing AWS for AI model development and safety research. Strategic Investment and Ownership: Amazon’s...

    TIP-I2V: The World’s Largest Dataset for Image-to-Video AI Research

    A Million-Scale Dataset Brings New Potential to Image-to-Video Generation Models Unprecedented Scale and Scope: TIP-I2V introduces over 1.7 million unique text and image prompts for...

    X-Portrait 2: Expressive Portrait Animation with Next-Level Realism and Emotion

    From subtle smirks to bold expressions, X-Portrait 2 transforms static images into lifelike animations for film, virtual agents, and more Advanced Expression Encoding: X-Portrait 2...

    GarVerseLOD: 3D Garment Reconstruction from a Single Image with High-Fidelity Detail Levels

    This groundbreaking dataset and framework achieve robust garment modeling from in-the-wild images, addressing challenges of complex poses and deformations. Advanced Dataset: GarVerseLOD introduces a large-scale...

    Microsoft’s Magentic-One: The New Open-Source AI Platform Redefining Autonomous Task Management

    With its new modular architecture, Magentic-One tackles complex tasks across domains, promising a future of AI-driven workflows. Multi-Agent Capabilities: Magentic-One uses a modular, multi-agent design,...

    The Rise of AI-Generated Images in Science: A New Challenge for Research Integrity

    As generative AI creates convincing scientific images, publishers and experts develop tools to detect and combat potential fraud. Growing Threat of AI-Generated Fakes: Generative AI...

    Apple Users Can Soon Access ChatGPT Plus Directly from iOS Settings

    With iOS 18.2, Apple brings ChatGPT Plus to its ecosystem, offering users a seamless way to upgrade while sparking questions about the financial dynamics...