Articles
Semantics-Oriented Reward Design With "PrefBERT," A New Evaluation Method To Evolve Long Sentence Generation
Semantics-Oriented Reward Design With "PrefBERT," A New Evaluation Method To Evolve Long Sentence Ge ...
The Challenge Of "Embodied Web Agents," The Next Generation AI That Fuses The Physical And Digital
The Challenge Of "Embodied Web Agents," The Next Generation AI That Fuses The Physical And Digital
A New Wave Of Multispeaker Speech Recognition! The Challenge Of High Accuracy Systems By DiCoW And DiariZen
A New Wave Of Multispeaker Speech Recognition! The Challenge Of High Accuracy Systems By DiCoW And D ...
GenRecal, A General-purpose Distillation Framework For Lightweight, High-performance Distillation
GenRecal, A General-purpose Distillation Framework For Lightweight, High-performance Distillation
ProtoReasoning: General-purpose Reasoning Skills Honed Through Logic And Planning
ProtoReasoning: General-purpose Reasoning Skills Honed Through Logic And Planning
A Proposal For Mixed-first Optimization That Revolutionizes The Inference Performance Of Multimodal LLMs!
A Proposal For Mixed-first Optimization That Revolutionizes The Inference Performance Of Multimodal ...
UnifiedCrawl: A New Approach To Low-Resource Language Data Collection And Efficient LLM Adaptation
UnifiedCrawl: A New Approach To Low-Resource Language Data Collection And Efficient LLM Adaptation
Other
OpenScholar: Knowledge Synthesis And Reliability Enhancement Of Scientific Literature With LLM
OpenScholar: Knowledge Synthesis And Reliability Enhancement Of Scientific Literature With LLM
LLMs As Mentors Instead Of Humans? Reinforcement Learning Agents Trained In Natural Language
LLMs As Mentors Instead Of Humans? Reinforcement Learning Agents Trained In Natural Language
Ultra-Sparse Memory Network: A New Method To Change Transformer Memory Efficiency
Ultra-Sparse Memory Network: A New Method To Change Transformer Memory Efficiency
Hymba, A New Architecture That Pushes The Limits Of Small LLMs
Hymba, A New Architecture That Pushes The Limits Of Small LLMs
Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought
Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought
Stable Flow: Visualization Of The "really Important Layers" Behind Image Generation
Stable Flow: Visualization Of The "really Important Layers" Behind Image Generation
Open Vocabulary Object Detection Enabled By OWL-ViT
Open Vocabulary Object Detection Enabled By OWL-ViT
Neural Network
SOK-Bench] Situational Video Inference Benchmark Using Real-World Knowledge In Video
SOK-Bench] Situational Video Inference Benchmark Using Real-World Knowledge In Video
Computer Vision
Libra] A New Multimodal Design Of Large Language Models Using Separate Vision Systems
Libra] A New Multimodal Design Of Large Language Models Using Separate Vision Systems
Large Language Models
DrHouse] Diagnostic System Using Sensor Information And Expertise
DrHouse] Diagnostic System Using Sensor Information And Expertise
Medical
Wavelet Diffusion: The Fastest Diffusion Model
Wavelet Diffusion: The Fastest Diffusion Model
Image Generation
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biometrics
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biomet ...
Large Language Models
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Available
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Availa ...
Machine Learning
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
ChatGPT
LATM Generates And Executes Extension Tools Using LLM
LATM Generates And Executes Extension Tools Using LLM
Large Language Models
[FinBen] Benchmark To Assess The Capabilities And Limitations Of LLM In The Financial Domain
[FinBen] Benchmark To Assess The Capabilities And Limitations Of LLM In The Financial Domain
Large Language Models
Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks
Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks
Large Language Models
MTAD-GAT Using Graph-attention For Multivariate Time Series Anomaly Detection
MTAD-GAT Using Graph-attention For Multivariate Time Series Anomaly Detection
Time-series
[Swin Transformer] Transformer-based Image Recognition Models To Keep Now!
[Swin Transformer] Transformer-based Image Recognition Models To Keep Now!
Image Recognition
AI Art Vs Human Art -Which Do People Prefer?
AI Art Vs Human Art -Which Do People Prefer?
Image Generation
Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore
Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore
Object Detection
Learn How OpenAI Trained Its 12-billion Parameter Text-to-image Generator: DALL-E
Learn How OpenAI Trained Its 12-billion Parameter Text-to-image Generator: DALL-E
Deep Learning
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage
Deep Learning
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using Reinforcement Learning
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using R ...
RLHF