Tunadorable

fixing flash-attention kernel Tunadorable 54 6 часов назад
LayerNorm | Triton GPU Kernels 101 Lesson #8 Tunadorable 113 3 дня назад
Matmul | Triton GPU Kernels 101 Lesson #6 Tunadorable 164 5 дней назад
Dropout | Triton GPU Kernels 101 Lesson #7 Tunadorable 81 5 дней назад
Fused Softmax | Triton GPU Kernels 101 Lesson #5 Tunadorable 79 5 дней назад
Vector addition | Triton GPU Kernels 101 Lesson #4 Tunadorable 135 5 дней назад
Have we been doing LLM inference wrong the whole time?!?! Tunadorable 6,258 1 месяц назад
Normalizing GPT on the unit-hypersphere (WITH CODE) Tunadorable 4,420 3 месяца назад
The Structured Task Hypothesis Tunadorable 1,826 7 месяцев назад
Why Does Diffusion Work Better than Auto-Regression? Algorithmic Simplicity 485,400 1 год назад
Does AI have any chance predicting chaotic systems? Tunadorable 4,133 5 месяцев назад
Conversational Swarm Intelligence with Dr. Louis Rosenberg Tunadorable 1,045 8 дней назад
SORA DÉTRÔNÉ ? Le CHOC des IA vidéo que personne n'attendait ! Johan : Solutions Digitales 3,441 1 день назад
Claude Almost Bankrupt Me... Theo - t3․gg 91,507 55 лет назад
Do we really need to use every single transformer layer? Tunadorable 2,290 5 месяцев назад
Sigma-GPTs: A New Approach to Autoregressive Models Tunadorable 2,888 7 месяцев назад
Accelerated Training by Amplifying Slow Gradients Tunadorable 32,244 8 месяцев назад
Are our perceptual systems structured to view the world truthfully? Tunadorable 2,766 6 месяцев назад
Curating December's best new AI papers from ArXiv Tunadorable 1,734 2 месяца назад
Rambling about GPT architecture edit ideas Tunadorable 1,460 5 месяцев назад
How to make neural networks better at learning new things Tunadorable 1,943 55 лет назад
What would it mean for an AI to "understand"? Tunadorable 3,749 6 месяцев назад
Let this method tune hyper-parameters for you! Tunadorable 2,122 1 месяц назад
A new way to compare high dimensional vectors Tunadorable 11,399 6 месяцев назад
Creating new tokens out of internal representations Tunadorable 2,685 2 месяца назад
Skimming hella AI paper abstracts - Nov 5, 2024 Tunadorable 1,607 3 месяца назад
channel update Tunadorable 1,644 55 лет назад
Models inside models inside models Tunadorable 2,347 5 месяцев назад
Some training tokens are more valuable than others Tunadorable 1,396 6 месяцев назад
Theoretical physics of next token prediction in LLMs Tunadorable 2,127 3 месяца назад
Skimming hella new AI paper abstracts - January 2025 Tunadorable 1,882 2 месяца назад