Natural Language Processing
Hymba, A New Architecture That Pushes The Limits Of Small LLMs
Hymba, A New Architecture That Pushes The Limits Of Small LLMs
Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought
Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought
Construction And Analysis Of The "TruthEval" Dataset To Expose LLM Weaknesses
Construction And Analysis Of The "TruthEval" Dataset To Expose LLM Weaknesses
Large Language Models
SportQA, A New Dataset That Measures The Comprehension Of Sports In Large Language Models
SportQA, A New Dataset That Measures The Comprehension Of Sports In Large Language Models
Large Language Models
CLAP-IPA: Acquisition Of Multilingual Phonetic Expressions By Contrastive Learning Of Speech And IPA Sequences
CLAP-IPA: Acquisition Of Multilingual Phonetic Expressions By Contrastive Learning Of Speech And IPA ...
Natural Language Processing
The Future Of Music Education, Flute X GPT And LAUI's Potential To Change Large-Scale Language Models
The Future Of Music Education, Flute X GPT And LAUI's Potential To Change Large-Scale Language Model ...
Large Language Models
Cross-Layer Attention Significantly Reduces Transformer Memory
Cross-Layer Attention Significantly Reduces Transformer Memory
Transformer
RiceChem] Dataset For Evaluating Automated Long-form Grading (ALAG) By LLM
RiceChem] Dataset For Evaluating Automated Long-form Grading (ALAG) By LLM
Large Language Models
Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks
Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks
Large Language Models
A Better Attention Mechanism Will Improve The Performance Of LLM's Long-text Processing!
A Better Attention Mechanism Will Improve The Performance Of LLM's Long-text Processing!
Large Language Models
Kolmogorov-Arnold Network (KAN) Instead Of MLP To Improve Model Expressiveness And Performance
Kolmogorov-Arnold Network (KAN) Instead Of MLP To Improve Model Expressiveness And Performance
Large Language Models
Improved Diagnostic Accuracy, New Diagnostic Support Through Medically Specialized LLM
Improved Diagnostic Accuracy, New Diagnostic Support Through Medically Specialized LLM
Large Language Models
[Chat-REC] Proposal For LLM-based Recommendation System
[Chat-REC] Proposal For LLM-based Recommendation System
Recommendation
A Method To Automatically Evaluate "the Accuracy Of LLM's Output Of Long Sentences" Was Created
A Method To Automatically Evaluate "the Accuracy Of LLM's Output Of Long Sentences" Was Created
Large Language Models
[AlphaCodium] Highest Performance Code Generation Method Specialized For Programming
[AlphaCodium] Highest Performance Code Generation Method Specialized For Programming
Large Language Models
The First Framework To Utilize LLM To Detect Fake News Is Now Available!
The First Framework To Utilize LLM To Detect Fake News Is Now Available!
Fakenews
Limitations And Solutions For Data-constrained LLM
Limitations And Solutions For Data-constrained LLM
Large Language Models