Catch up on the latest AI articles

What is AI-SCHOLAR?

Skywork UniPic: Next-generation Multimodal Model That Integrates Image Understanding, Generation, And Editing With High Efficiency

Skywork UniPic: Next-generation Multimodal Model That Integrates Image Understanding, Generation, An ...

Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High Performance

Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High ...

Combining Speed And Accuracy: Quantization-aware LLM Pre-training "QAP

Combining Speed And Accuracy: Quantization-aware LLM Pre-training "QAP

Forget-Me-Not: A Proposal For A Simple Prompting Technique To Prevent Forgetting Information In Long Prompts

Forget-Me-Not: A Proposal For A Simple Prompting Technique To Prevent Forgetting Information In Long ...

Potential Of The Conversation Optimization Tokenizer: A Method To Improve LLM Inference Efficiency By 10%

Potential Of The Conversation Optimization Tokenizer: A Method To Improve LLM Inference Efficiency B ...

RoboTwin 2.0: Scalable Synthetic Data Generation And Benchmark Design For Dual-Arm Manipulation Robots

RoboTwin 2.0: Scalable Synthetic Data Generation And Benchmark Design For Dual-Arm Manipulation Robo ...

Enhanced LLM Code Generation With Property-based Testing! New Framework PGS To Break Self-Deception

Enhanced LLM Code Generation With Property-based Testing! New Framework PGS To Break Self-Deception

Evolution Of Llama To Support Reinforcement Learning, OctoThinker Shows The Power Of Intermediate Learning

Evolution Of Llama To Support Reinforcement Learning, OctoThinker Shows The Power Of Intermediate Le ...

Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge

Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge

FedNano: Lightweight And Efficient Distributed Learning Of Large-scale Multimodal Models

FedNano: Lightweight And Efficient Distributed Learning Of Large-scale Multimodal Models

ImmerseGen: Agent-guided, Lightweight X Highly Realistic Next-generation VR Scene Generation

ImmerseGen: Agent-guided, Lightweight X Highly Realistic Next-generation VR Scene Generation

PictSure: A New Method To Challenge Few-Shot Classification With The Power Of Visual Embedding

PictSure: A New Method To Challenge Few-Shot Classification With The Power Of Visual Embedding

Semantics-Oriented Reward Design With "PrefBERT," A New Evaluation Method To Evolve Long Sentence Generation

Semantics-Oriented Reward Design With "PrefBERT," A New Evaluation Method To Evolve Long Sentence Ge ...

Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought

Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought

Stable Flow: Visualization Of The "really Important Layers" Behind Image Generation

Stable Flow: Visualization Of The "really Important Layers" Behind Image Generation

Construction And Analysis Of The "TruthEval" Dataset To Expose LLM Weaknesses

Construction And Analysis Of The "TruthEval" Dataset To Expose LLM Weaknesses

31/01/2025 Large Language Models

MaskDiT: Low Learning Cost Diffusion Model For Image Generation

MaskDiT: Low Learning Cost Diffusion Model For Image Generation

27/01/2025 Image Generation

Limitations And Possibilities Of Large-Scale Language Models In Vietnamese High School Chemistry Exam Questions

Limitations And Possibilities Of Large-Scale Language Models In Vietnamese High School Chemistry Exa ...

08/01/2025 Large Language Models