Tunadorable

fixing flash-attention kernel

fixing flash-attention kernel Tunadorable 16 54 6 часов назад

Building GPU kernel tutorials for dropout & layernorm - live 2025.2.21

Building GPU kernel tutorials for dropout & layernorm - live 2025.2.21 Tunadorable 11 38 7 часов назад

LayerNorm | Triton GPU Kernels 101 Lesson #8

LayerNorm | Triton GPU Kernels 101 Lesson #8 Tunadorable 34 113 3 дня назад

Matmul | Triton GPU Kernels 101 Lesson #6

Matmul | Triton GPU Kernels 101 Lesson #6 Tunadorable 49 164 5 дней назад

Dropout | Triton GPU Kernels 101 Lesson #7

Dropout | Triton GPU Kernels 101 Lesson #7 Tunadorable 24 81 5 дней назад

Fused Softmax | Triton GPU Kernels 101 Lesson #5

Fused Softmax | Triton GPU Kernels 101 Lesson #5 Tunadorable 24 79 5 дней назад

Vector addition | Triton GPU Kernels 101 Lesson #4

Vector addition | Triton GPU Kernels 101 Lesson #4 Tunadorable 41 135 5 дней назад

How to use a cloud GPU | Triton GPU Kernels 101 Lesson #3

How to use a cloud GPU | Triton GPU Kernels 101 Lesson #3 Tunadorable 59 198 5 дней назад

GPU Architecture Basics | Triton GPU Kernels 101 Lesson #2

GPU Architecture Basics | Triton GPU Kernels 101 Lesson #2 Tunadorable 55 183 5 дней назад

Building GPU kernel tutorials for dropout & layernorm - live 2025.2.21

Building GPU kernel tutorials for dropout & layernorm - live 2025.2.21 Tunadorable 11 38 7 часов назад

Have we been doing LLM inference wrong the whole time?!?!

Have we been doing LLM inference wrong the whole time?!?! Tunadorable 2K 6,258 1 месяц назад

Normalizing GPT on the unit-hypersphere (WITH CODE)

Normalizing GPT on the unit-hypersphere (WITH CODE) Tunadorable 1K 4,420 3 месяца назад

The Structured Task Hypothesis

The Structured Task Hypothesis Tunadorable 548 1,826 7 месяцев назад

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression? Algorithmic Simplicity 146K 485,400 1 год назад

Does AI have any chance predicting chaotic systems?

Does AI have any chance predicting chaotic systems? Tunadorable 1K 4,133 5 месяцев назад

Conversational Swarm Intelligence with Dr. Louis Rosenberg

Conversational Swarm Intelligence with Dr. Louis Rosenberg Tunadorable 314 1,045 8 дней назад

purposely "pre-caching" features or inadvertently leaving "breadcrumbs" for future timesteps?

purposely "pre-caching" features or inadvertently leaving "breadcrumbs" for future timesteps? Tunadorable 698 2,328 6 месяцев назад

SORA DÉTRÔNÉ ? Le CHOC des IA vidéo que personne n'attendait !

SORA DÉTRÔNÉ ? Le CHOC des IA vidéo que personne n'attendait ! Johan : Solutions Digitales 1K 3,441 1 день назад

Claude Almost Bankrupt Me...

Claude Almost Bankrupt Me... Theo - t3․gg 27K 91,507 55 лет назад

Do we really need to use every single transformer layer?

Do we really need to use every single transformer layer? Tunadorable 687 2,290 5 месяцев назад

Sigma-GPTs: A New Approach to Autoregressive Models

Sigma-GPTs: A New Approach to Autoregressive Models Tunadorable 866 2,888 7 месяцев назад

Accelerated Training by Amplifying Slow Gradients

Accelerated Training by Amplifying Slow Gradients Tunadorable 10K 32,244 8 месяцев назад

Are our perceptual systems structured to view the world truthfully?

Are our perceptual systems structured to view the world truthfully? Tunadorable 830 2,766 6 месяцев назад

Curating December's best new AI papers from ArXiv

Curating December's best new AI papers from ArXiv Tunadorable 520 1,734 2 месяца назад

Rambling about GPT architecture edit ideas

Rambling about GPT architecture edit ideas Tunadorable 438 1,460 5 месяцев назад

How to make neural networks better at learning new things

How to make neural networks better at learning new things Tunadorable 583 1,943 55 лет назад

What would it mean for an AI to "understand"?

What would it mean for an AI to "understand"? Tunadorable 1K 3,749 6 месяцев назад

Let this method tune hyper-parameters for you!

Let this method tune hyper-parameters for you! Tunadorable 637 2,122 1 месяц назад

A new way to compare high dimensional vectors

A new way to compare high dimensional vectors Tunadorable 3K 11,399 6 месяцев назад

Creating new tokens out of internal representations

Creating new tokens out of internal representations Tunadorable 806 2,685 2 месяца назад

Skimming hella AI paper abstracts - Nov 5, 2024

Skimming hella AI paper abstracts - Nov 5, 2024 Tunadorable 482 1,607 3 месяца назад

channel update

channel update Tunadorable 493 1,644 55 лет назад

Models inside models inside models

Models inside models inside models Tunadorable 704 2,347 5 месяцев назад

Some training tokens are more valuable than others

Some training tokens are more valuable than others Tunadorable 419 1,396 6 месяцев назад

Theoretical physics of next token prediction in LLMs

Theoretical physics of next token prediction in LLMs Tunadorable 638 2,127 3 месяца назад

How diffusion modeling can revolutionize evolutionary algorithms

How diffusion modeling can revolutionize evolutionary algorithms Tunadorable 2K 5,673 1 месяц назад

Skimming hella new AI paper abstracts - January 2025

Skimming hella new AI paper abstracts - January 2025 Tunadorable 565 1,882 2 месяца назад

Сейчас ищут

Tunadorable آلة قهوة Ремонт Тракторов Амкодор 333В Разбираем Тормоза Akt原神 Hobbyhorsing De Замужняя На Свидании Only Tech Las Travesuras De Lucia

Tunadorable. Смотреть видео: Fixing Flash Attention Kernel, Building GPU Kernel Tutorials For Dropout Layernorm Live 2025 2 21, LayerNorm Triton GPU Kernels 101 Lesson 8, Matmul Triton GPU Kernels 101 Lesson 6.