Exploring How AI-Generated Humor Measures Up Against Professional Satire
AI vs. Human Humor: ChatGPT's jokes were rated as equally funny or funnier than human-generated jokes,...
Unveiling New Possibilities in Text-Image Comprehension and Composition
Enhanced Vision-Language Comprehension: IXC-2.5 supports ultra-high resolution and fine-grained video understanding, along with multi-turn multi-image dialogue.
Extended Contextual...
A Benchmark for Analyzing the Foundations of Visual Mathematical Reasoning
Benchmark Introduction: WE-MATH is the first benchmark focused on the problem-solving principles behind LMMs' performance,...
New dataset boosts medical capabilities of large language models
PubMedVision dataset refines medical image-text pairs to enhance multimodal large language models (MLLMs).
HuatuoGPT-Vision, trained on PubMedVision,...
AI-generated answers achieve higher grades and evade detection in university exams
AI-generated answers scored higher than real students in undergraduate exams.
94% of AI essays went...
The rise of AI vocal technology could upend creative fields like audiobooks and voice acting
AI voice clones are beginning to replace human voice actors...
Amazon bolsters its AGI development with strategic hires and licensing agreements
Strategic Hiring and Licensing: Amazon hires key executives from Adept and licenses its AI...
Integrating pixel-level understanding with powerful reasoning for advanced multimodal interactions
Unified Model Architecture: OMG-LLaVA combines image-level, object-level, and pixel-level reasoning within a single framework, enhancing...
A breakthrough in 3D generation with text-to-image diffusion models
YOUDREAM generates high-quality, anatomically controllable 3D animals using a text-to-image diffusion model guided by 2D views...
Florence-2 integrates diverse vision and vision-language tasks through a novel prompt-based model.
Florence-2 utilizes a unified, prompt-based approach for various vision and vision-language tasks.
The model...
Apple introduces AI advancements with on-device and server models for enhanced user experience
Apple unveils a 3 billion parameter on-device language model and a more...