Xiaol.x - Invidious

Xiaol.x

Researcher, this channel is just my reading list.
Currently, my research focuses on RNNs in large language models (LLMs). I’m interested in exploring how to adapt transformer-based models, like the 671B R1, to use RNN attention. You can check out my ongoing work on ARWKV here.
huggingface.co/papers/2501.15570

X: x.com/xiaolGo
Papers: scholar.google.com/citations?user=TPJYxnkAAAAJ

View channel on YouTube

Switch Invidious Instance

Videos

newest

oldest

popular

22:42

What does it actually take to train a high-performance LLM today?

Xiaol.x

Shared 22 hours ago

54 views

26:17

A Definition of AGI

Xiaol.x

Shared 1 day ago

66 views

25:52

Scaling Latent Reasoning via Looped Language Models

Xiaol.x

Shared 2 days ago

86 views

18:45

Large Language Models Report Subjective Experience Under Self-Referential Processing

Xiaol.x

Shared 3 days ago

70 views

22:44

Nested Learning: The Illusion of Deep Learning Architectures

Xiaol.x

Shared 4 days ago

952 views

24:59

BADAS: Context Aware Collision Prediction Using Real-World Dashcam Data

Xiaol.x

Shared 4 days ago

69 views

26:19

Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in LLM

Xiaol.x

Shared 5 days ago

399 views

22:56

Deep sequence models tend to memorize geometrically; it is unclear why

Xiaol.x

Shared 1 week ago

283 views

24:05

Radiology's Last Exam (RadLE):Benchmarking Frontier Multimodal AI Against Human Experts in Radiology

Xiaol.x

Shared 1 week ago

37 views

32:39

LongLive: Real-time Interactive Long Video Generation

Xiaol.x

Shared 1 week ago

61 views

19:56

KIMI Linear: an expressive, efficient attention architecture

Xiaol.x

Shared 2 weeks ago

350 views

23:40

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Xiaol.x

Shared 2 weeks ago

157 views

23:05

CMT-Benchmark: A Benchmark for Condensed Matter Theory Built by Expert Researchers

Xiaol.x

Shared 2 weeks ago

52 views

25:42

Can Large Language Models Develop Gambling Addiction?

Xiaol.x

Shared 2 weeks ago

167 views

24:03

Wan-Alpha: High-Quality Text-to-Video Generation with Alpha Channel

Xiaol.x

Shared 2 weeks ago

54 views

25:44

Agent Learning via Early Experience

Xiaol.x

Shared 2 weeks ago

187 views

21:41

Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences

Xiaol.x

Shared 2 weeks ago

103 views

21:57

Pretraining Large Language Models with NVFP4

Xiaol.x

Shared 2 weeks ago

94 views

24:11

Less is More: Recursive Reasoning with Tiny Networks

Xiaol.x

Shared 3 weeks ago

180 views

25:58

To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models

Xiaol.x

Shared 3 weeks ago

56 views

23:10

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brai

Xiaol.x

Shared 3 weeks ago

492 views

21:41

Mamba-3: Improved Sequence Modeling using State Space Principles

Xiaol.x

Shared 3 weeks ago

219 views

11:05

Determination of the fifth Busy Beaver value

Xiaol.x

Shared 3 weeks ago

108 views

15:13

Compression( Richard Sutton – Father of RL thinks LLMs are a dead end )

Xiaol.x

Shared 3 weeks ago

101 views

9:01

FlowRL: Matching Reward Distributions for LLM Reasoning

Xiaol.x

Shared 4 weeks ago

276 views

6:57

Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents

Xiaol.x

Shared 4 weeks ago

174 views

7:16

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Xiaol.x

Shared 4 weeks ago

82 views

7:58

DCPO: Dynamic Clipping Policy Optimization

Xiaol.x

Shared 1 month ago

98 views

7:19

My Favorite Streamer is an LLM: Discovering, Bonding, and Co-Creating in AI VTuber Fandom

Xiaol.x

Shared 1 month ago

3.7K views

26:57

DREAMSTATE: Diffusing States and Parameters for Recurrent Large Language Models

Xiaol.x

Shared 1 month ago

144 views

7:19

Scaling Agents via Continual Pre-training

Xiaol.x

Shared 1 month ago

59 views

8:05

Adaptive LLM Routing under Budget Constraints

Xiaol.x

Shared 1 month ago

100 views

7:43

RAPTOR: A Foundation Policy for Quadrotor Control

Xiaol.x

Shared 1 month ago

887 views

7:28

RL's Razor: Why Online Reinforcement Learning Forgets Less

Xiaol.x

Shared 1 month ago

1.5K views

8:31

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Xiaol.x

Shared 1 month ago

3.3K views

8:52

Reconstruction Alignment Improves Unified Multimodal Models

Xiaol.x

Shared 1 month ago

226 views

7:05

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Xiaol.x

Shared 1 month ago

637 views

7:10

LIMI: Less is More for Agency

Xiaol.x

Shared 1 month ago

894 views

8:21

A Survey of Reinforcement Learning for Large Reasoning Models

Xiaol.x

Shared 1 month ago

201 views

8:35

Reverse-Engineered Reasoning for Open-Ended Generation

Xiaol.x

Shared 1 month ago

133 views

6:51

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Xiaol.x

Shared 1 month ago

51 views

7:10

Virtual Agent Economies

Xiaol.x

Shared 1 month ago

62 views

6:34

UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward

Xiaol.x

Shared 1 month ago

33 views

8:00

rStar2-Agent: Agentic Reasoning Technical Report

Xiaol.x

Shared 1 month ago

75 views

8:01

Video models are zero-shot learners and reasoners

Xiaol.x

Shared 1 month ago

280 views

7:41

EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

Xiaol.x

Shared 1 month ago

46 views

8:46

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Xiaol.x

Shared 1 month ago

84 views

6:25

Mathematical research with GPT-5: a Malliavin-Stein experiment

Xiaol.x

Shared 1 month ago

75 views

8:22

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Xiaol.x

Shared 1 month ago

44 views

8:11

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

Xiaol.x

Shared 1 month ago

124 views

7:51

An AI system to help scientists write expert-level empirical software

Xiaol.x

Shared 1 month ago

51 views

7:18

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Xiaol.x

Shared 1 month ago

41 views

7:11

On the Theoretical Limitations of Embedding-Based Retrieval

Xiaol.x

Shared 1 month ago

156 views

6:35

Why Language Models Hallucinate

Xiaol.x

Shared 1 month ago

60 views

6:32

Analog in-memory computing attention mechanism for fast and energy-efficient large language models

Xiaol.x

Shared 1 month ago

588 views

14:03

Latent learning: episodic memory complements parametric learning by enabling reuse of experiences

Xiaol.x

Shared 1 month ago

122 views

6:33

Linear Memory SE(2) Invariant Attention

Xiaol.x

Shared 2 months ago

133 views

6:35

DiffLoRA: Differential Low-Rank Adapters for Large Language Models

Xiaol.x

Shared 2 months ago

157 views

6:25

Associative memory inspires improvements for ICL using a novel attention residual stream

Xiaol.x

Shared 2 months ago

135 views

6:23

On the Expressiveness of Softmax Attention: A Recurrent Neural Network Perspective

Xiaol.x

Shared 2 months ago

119 views