HomeAI News

AI News

Super-Sized Transformers: Scaling BERT to 1M Tokens and Beyond

Recurrent Memory Transformer Enables Unprecedented Context Length in NLP Models Researchers have applied recurrent memory to BERT, extending the model's context length to an impressive...

Anxiety in AI: How Emotions Influence Large Language Models

Study Reveals Anxiety-Inducing Prompts Increase Exploration and Bias in GPT-3.5 GPT-3.5 displays higher anxiety scores than human subjects when subjected to a common anxiety questionnaire. Emotion-inducing...

CancerGPT: A Leap Forward in Few-Shot Drug Pair Synergy Prediction

Large Pre-trained Language Models Tackle Biological Inference for Rare Tissues with Limited Data CancerGPT, a few-shot learning approach, uses LLMs to predict drug pair synergy...

Uncovering Alignment Limitations in Large Language Models

Behavior Expectation Bounds Framework Reveals Fundamental Challenges in AI Safety Behavior Expectation Bounds (BEB) framework introduced to investigate inherent characteristics and limitations of alignment in...

Meta AI DINOv2: The Self-Supervised Vision Transformer Revolution

Multipurpose Backbone for a Wide Range of Computer Vision Tasks Without Fine-tuning Key Points: Meta AI announces DINOv2, a self-supervised vision transformer model for various computer...

Graphologue & Sensecape: A Game-Changer for Human-AI Interaction

UCSD Researchers Develop a Tool That Turns GPT-4 Text into Real-time Interactive Node-link Diagrams UCSD researchers have developed Graphologue to address the limitations of text-based...

Google CEO’s Stark Warning on AI Acceleration

Sundar Pichai Urges Society to Prepare for the Far-Reaching Impact of Artificial Intelligence Key Points: Google CEO Sundar Pichai emphasizes that AI will impact "every product...

ImpressionGPT: A Breakthrough in Radiology Report Summarization

Leveraging Large Language Models for Contextual Learning and Iterative Optimization in Medical Imaging Reports Key Points: ImpressionGPT leverages ChatGPT's in-context learning capability, utilizing dynamic prompts and...

The Impact of Code in Pretraining LLMs: Dissecting the Evolution of ChatGPT

Exploring the connection between code in pretraining and the emergent abilities of large language models Key Points: GPT-3.5 models, including ChatGPT, exhibit impressive capabilities, raising questions...

MiniGPT-4: Exploring Advanced Vision-Language Understanding with Large Language Models

Unveiling the capabilities and efficiency of MiniGPT-4 in various vision-language tasks Key Points: MiniGPT-4 aligns a frozen visual encoder with a frozen LLM, Vicuna, using just...

The Power of LLMs for Zero-Shot Next-Item Recommendations

A novel prompting strategy opens up new possibilities for LLMs in the recommendation domain Large language models (LLMs), such as GPT-3, have shown remarkable zero-shot...

WebBrain: A Groundbreaking Approach to Generating Factual Articles

New NLP task and dataset set the stage for improved information extraction and generation In the new paper WebBrain: Learning to Generate Factually Correct Articles...