Language Generation
Hymba, A New Architecture That Pushes The Limits Of Small LLMs
Hymba, A New Architecture That Pushes The Limits Of Small LLMs
Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought
Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought
PosterLlama: Ability To Design Language Models And Generate Content-aware Layouts
PosterLlama: Ability To Design Language Models And Generate Content-aware Layouts
Layout-gen
Limitations And Possibilities Of Large-Scale Language Models In Vietnamese High School Chemistry Exam Questions
Limitations And Possibilities Of Large-Scale Language Models In Vietnamese High School Chemistry Exa ...
Large Language Models
YesBut: The Emergence Of A Dataset That Makes The VLM Understand Irony And Caricature!
YesBut: The Emergence Of A Dataset That Makes The VLM Understand Irony And Caricature!
Dataset
InfiMM-WebMath-40B] Improves The Mathematical Performance Of LLM With A Dataset Consisting Of 2.4 Billion Mathematical Documents!
InfiMM-WebMath-40B] Improves The Mathematical Performance Of LLM With A Dataset Consisting Of 2.4 Bi ...
Datasets
AI To Transform Mathematics Education; Possibilities And Challenges Of Solving Mathematical Problems Using Large-Scale Language Mo ...
AI To Transform Mathematics Education; Possibilities And Challenges Of Solving Mathematical Problems ...
Large Language Models
Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks
Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks
Large Language Models
[NVLM] Multimodal LLM Outperforms GPT-4o In Image And Language Tasks
[NVLM] Multimodal LLM Outperforms GPT-4o In Image And Language Tasks
Large Language Models
[beeFormer] Transformer Is Trained By Combining Text Information And Interaction Data In The Recommendation System
[beeFormer] Transformer Is Trained By Combining Text Information And Interaction Data In The Recomme ...
Large Language Models
[RetrievalAttention] Improved Efficiency Of LLM For Processing Long Contexts!
[RetrievalAttention] Improved Efficiency Of LLM For Processing Long Contexts!
Large Language Models
[BitNet B1.58] Achieved Accuracy Better Than Llama By Expressing Model Parameters In Three Values!
[BitNet B1.58] Achieved Accuracy Better Than Llama By Expressing Model Parameters In Three Values!
Large Language Models
[Mustango] Music Generation Model Utilizing Domain Knowledge Of Music
[Mustango] Music Generation Model Utilizing Domain Knowledge Of Music
Audio And Speech Processing
[VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry
[VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry
Speech Synthesis
A Method To Automatically Evaluate "the Accuracy Of LLM's Output Of Long Sentences" Was Created
A Method To Automatically Evaluate "the Accuracy Of LLM's Output Of Long Sentences" Was Created
Large Language Models
[AlphaCodium] Highest Performance Code Generation Method Specialized For Programming
[AlphaCodium] Highest Performance Code Generation Method Specialized For Programming
Large Language Models
OpenToM, A Benchmark For Evaluating Whether An LLM Has A "theory Of Mind," Is Now Available!
OpenToM, A Benchmark For Evaluating Whether An LLM Has A "theory Of Mind," Is Now Available!
Datasets