Researcher, this channel is just my reading list.
Currently, my research focuses on RNNs in large language models (LLMs). I’m interested in exploring how to adapt transformer-based models, like the 671B R1, to use RNN attention. You can check out my ongoing work on ARWKV here.
huggingface.co/papers/2501.15570
X: x.com/xiaolGo
Papers: scholar.google.com/citations?user=TPJYxnkAAAAJ
Shared 22 hours ago
54 views
Shared 3 days ago
70 views
Shared 4 days ago
69 views
Shared 5 days ago
399 views
Shared 1 week ago
283 views
Radiology's Last Exam (RadLE):Benchmarking Frontier Multimodal AI Against Human Experts in Radiology
Shared 1 week ago
37 views
Shared 2 weeks ago
157 views
Shared 2 weeks ago
52 views
Shared 2 weeks ago
54 views
Shared 2 weeks ago
103 views
Shared 3 weeks ago
56 views
Shared 3 weeks ago
492 views
Shared 3 weeks ago
219 views
Shared 3 weeks ago
101 views
Shared 4 weeks ago
174 views
Shared 4 weeks ago
82 views
Shared 1 month ago
3.7K views
Shared 1 month ago
144 views
Shared 1 month ago
3.3K views
Shared 1 month ago
637 views
Shared 1 month ago
51 views
Shared 1 month ago
33 views
Shared 1 month ago
46 views
Shared 1 month ago
44 views
Shared 1 month ago
124 views
Shared 1 month ago
51 views
Shared 1 month ago
41 views
Shared 1 month ago
588 views
Shared 1 month ago
122 views
Shared 2 months ago
157 views
Shared 2 months ago
135 views
Shared 2 months ago
119 views