Articles
AlignGuard-LoRA: A New Regularization Method That Combines Efficient Fine-Tuning And Safety Preservation
AlignGuard-LoRA: A New Regularization Method That Combines Efficient Fine-Tuning And Safety Preserva ...
ChartCap: Suppressing Chart Captioning Hallucinations With Large Data Sets And New Evaluation Indexes
ChartCap: Suppressing Chart Captioning Hallucinations With Large Data Sets And New Evaluation Indexe ...
LAMIC: A Learning-free, Layout-controllable, Multi-reference Image Generation Method
LAMIC: A Learning-free, Layout-controllable, Multi-reference Image Generation Method
LiveMCPBench: A New Benchmark For Evaluating LLM Agents In Large Tool Environments
LiveMCPBench: A New Benchmark For Evaluating LLM Agents In Large Tool Environments
Goedel-Prover-V2: New Developments In Efficient Automated Theorem Proving By Self-Correction And Stepwise Data Synthesis
Goedel-Prover-V2: New Developments In Efficient Automated Theorem Proving By Self-Correction And Ste ...
New Developments In Multi-person Conversation Video Generation! MIT Dataset And Baseline Model "CovOG
New Developments In Multi-person Conversation Video Generation! MIT Dataset And Baseline Model "CovO ...
ToolTrain: A New Method For Repository Deep Search And Issue Localization With LLM
ToolTrain: A New Method For Repository Deep Search And Issue Localization With LLM
Mechanism And Effect Of "Representation Shift" Token Compression For FlashAttention
Mechanism And Effect Of "Representation Shift" Token Compression For FlashAttention
CRINN: Automatic Optimization Of Approximate Nearest Neighbor Search Algorithms Using Reinforcement Learning
CRINN: Automatic Optimization Of Approximate Nearest Neighbor Search Algorithms Using Reinforcement ...
CompassVerifier: A New Benchmark And Robust Model To Revolutionize LLM Solution Verification
CompassVerifier: A New Benchmark And Robust Model To Revolutionize LLM Solution Verification
LongVie: A New Era Of 1-minute Ultra-High Quality Video Generation Realized By Multimodal Control
LongVie: A New Era Of 1-minute Ultra-High Quality Video Generation Realized By Multimodal Control
Skywork UniPic: Next-generation Multimodal Model That Integrates Image Understanding, Generation, And Editing With High Efficiency
Skywork UniPic: Next-generation Multimodal Model That Integrates Image Understanding, Generation, An ...
Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High Performance
Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High ...
MATE: Multi-agent Accessibility-specific Modality Transformation Framework
MATE: Multi-agent Accessibility-specific Modality Transformation Framework
Biomed-Enriched: Large Biomedical Dataset With LLM Annotation For Clinical And Educational Value
Biomed-Enriched: Large Biomedical Dataset With LLM Annotation For Clinical And Educational Value
How Many Times Is Debugging LLM Effective? What Is The New Indicator "DDI" To Detect The Decay Of Effectiveness?
How Many Times Is Debugging LLM Effective? What Is The New Indicator "DDI" To Detect The Decay Of Ef ...
Combining Speed And Accuracy: Quantization-aware LLM Pre-training "QAP
Combining Speed And Accuracy: Quantization-aware LLM Pre-training "QAP
Wavelet Diffusion: The Fastest Diffusion Model
Wavelet Diffusion: The Fastest Diffusion Model
Image Generation
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biometrics
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biomet ...
Large Language Models
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Available
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Availa ...
Machine Learning
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
ChatGPT
Mask R-CNN: Efficient Detection Of Objects In Images
Mask R-CNN: Efficient Detection Of Objects In Images
Computer Vision
The First Framework To Utilize LLM To Detect Fake News Is Now Available!
The First Framework To Utilize LLM To Detect Fake News Is Now Available!
Fakenews
Graphs Are So Awesome! Review Of Integration With Deep Learning
Graphs Are So Awesome! Review Of Integration With Deep Learning
GNN
Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge
Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge
LLM Agents Successfully Lead Customers To Purchase 35% Of The Time!
LLM Agents Successfully Lead Customers To Purchase 35% Of The Time!
ChatGPT
Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High Performance
Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High ...
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage
Deep Learning
ImageReward: A Reward Model That Learns Human Evaluation In Text-to-image
ImageReward: A Reward Model That Learns Human Evaluation In Text-to-image
Alignment
RLHF: How To Train Reinforcement Learning Agents Using Human Evaluation
RLHF: How To Train Reinforcement Learning Agents Using Human Evaluation
Alignment
Integration Of Large-scale Language Models In HCI Research And Ethical Issues
Integration Of Large-scale Language Models In HCI Research And Ethical Issues
Large Language Models