Tech

ViPer: Visual Personalization of Generative Models via Individual Preference Learning

Tailoring AI-Generated Images to Individual Tastes ViPer personalizes image generation by capturing and applying individual visual preferences. The system uses user comments to infer visual likes...

HumanVid: Demystifying Training Data for Camera-Controllable Human Image Animation

A New Benchmark for Realistic Human Image Animation and Camera Control HumanVid introduces the first large-scale, high-quality dataset tailored for human image animation, combining real-world...

Kling AI Now Open for Worldwide Users

Kuaishou's AI Video Generator Goes Global, Challenging OpenAI's Sora Kling AI, Kuaishou's AI video generator, is now available globally, providing text-to-video and image-plus-text-to-video generation. Users receive...

AI, Go Fetch! New NVIDIA NeMo Retriever Microservices Boost LLM Accuracy and Throughput

NVIDIA's Latest Offering Enhances Data Retrieval for AI Application NVIDIA introduces NeMo Retriever NIM inference microservices to improve AI accuracy and efficiency. The microservices are integrated...

T2V-CompBench: Setting a New Standard for Compositional Text-to-Video Generation

Introducing the First Comprehensive Benchmark for Complex Video Generation from Text Prompts T2V-CompBench offers the first benchmark tailored for compositional text-to-video generation. The benchmark includes diverse...

OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person

Revolutionizing the Virtual Fashion Experience OutfitAnyone utilizes a two-stream conditional diffusion model for lifelike virtual try-on experiences. The technology adapts to various body shapes and poses,...

Meta’s New Llama 3.1 AI Model Is Free, Powerful, and Risky

Meta's Open AI Model Spurs Accessibility and Safety Concerns Meta releases Llama 3.1 AI model for free, emphasizing accessibility and customization. The model’s open nature sparks...

Apple Showcases Open AI Capabilities with New Models

New Apple AI Models Outperform Competitors Mistral and Hugging Face Apple releases DCLM models on Hugging Face, featuring 7 billion and 1.4 billion parameter variants. The...

Neo4j Introduces LLM Knowledge Graph Builder for Unstructured Data

Transforming Unstructured Data into Organized Knowledge Graphs Neo4j's LLM Knowledge Graph Builder converts unstructured data into knowledge graphs. Utilizes a range of powerful machine learning models...

Click-Gaussian: Revolutionizing 3D Segmentation

Real-Time, Fine-Grained 3D Scene Manipulation Made Possible Click-Gaussian enables rapid and accurate segmentation of 3D Gaussians. The Global Feature-guided Learning (GFL) method enhances segmentation accuracy. The method...

IBM: Scaling Granite Code Models to 128K Context

Expanding Context Windows in Open-Source Code Models IBM introduces Granite code models supporting up to 128K token context windows. Lightweight continual pretraining and instruction tuning enhance...

AI Enhances Doctor-Patient Communication

Study shows AI-generated responses outperform human doctors in empathy but raise readability concerns Superior Empathy: AI-generated responses rated higher in empathy compared to those written...