Cross-Layer Attention Significantly Reduces Transformer Memory Cross-Layer Attention Significantly Reduces Transformer Memory 10/12/2024 Transformer
RiceChem] Dataset For Evaluating Automated Long-form Grading (ALAG) By LLM RiceChem] Dataset For Evaluating Automated Long-form Grading (ALAG) By LLM 26/11/2024 Large Language Models
Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks 11/10/2024 Large Language Models
A Better Attention Mechanism Will Improve The Performance Of LLM's Long-text Processing! A Better Attention Mechanism Will Improve The Performance Of LLM's Long-text Processing! 30/09/2024 Large Language Models
Kolmogorov-Arnold Network (KAN) Instead Of MLP To Improve Model Expressiveness And Performance Kolmogorov-Arnold Network (KAN) Instead Of MLP To Improve Model Expressiveness And Performance 24/09/2024 Large Language Models
Improved Diagnostic Accuracy, New Diagnostic Support Through Medically Specialized LLM Improved Diagnostic Accuracy, New Diagnostic Support Through Medically Specialized LLM 31/07/2024 Large Language Models
[Chat-REC] Proposal For LLM-based Recommendation System [Chat-REC] Proposal For LLM-based Recommendation System 24/07/2024 Recommendation
A Method To Automatically Evaluate "the Accuracy Of LLM's Output Of Long Sentences" Was Created A Method To Automatically Evaluate "the Accuracy Of LLM's Output Of Long Sentences" Was Created 01/07/2024 Large Language Models
[AlphaCodium] Highest Performance Code Generation Method Specialized For Programming [AlphaCodium] Highest Performance Code Generation Method Specialized For Programming 30/05/2024 Large Language Models
The First Framework To Utilize LLM To Detect Fake News Is Now Available! The First Framework To Utilize LLM To Detect Fake News Is Now Available! 26/05/2024 Fakenews
Limitations And Solutions For Data-constrained LLM Limitations And Solutions For Data-constrained LLM 05/03/2024 Large Language Models
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using Reinforcement Learning [DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using R ... 02/02/2024 RLHF
Apple's Efficient Inference Of Large Language Models On Devices With Limited Memory Capacity Apple's Efficient Inference Of Large Language Models On Devices With Limited Memory Capacity 29/01/2024 Large Language Models
[CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality [CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality 12/01/2024 Diffusion Model
What Is A Good Vocabulary In Machine Translation? What Is A Good Vocabulary In Machine Translation? 05/12/2023 Natural Language Processing
LP-MusicCaps] Automatic Generation Of Music Captions Using LLM LP-MusicCaps] Automatic Generation Of Music Captions Using LLM 20/11/2023 Contrastive Learning
Mind's Eye: Using Simulation To Improve Physical Reasoning Ability Prompt Extension Mind's Eye: Using Simulation To Improve Physical Reasoning Ability Prompt Extension 27/09/2023 Large Language Models