Outer Web | Explore

Explore >> Select a destination

You are here		kawine.github.io Word Embedding Analogies: Understanding King - Man + Woman = Queen \| Kawin Ethayarajh
\|	\|	jxmo.io Introduction to variational autoencoders - Jack Morris	18.2 parsecs away Travel
\|	\|	A primer on variational autoencoders (VAEs) culminating in a PyTorch implementation of a VAE with discrete latents.	18.2 parsecs away Travel
\|	\|	windowsontheory.org A blitz through classical statistical learning theory - Windows On Theory	15.3 parsecs away Travel
\|	\|	Previous post: ML theory with bad drawings Next post: What do neural networks learn and when do they learn it, see also all seminar posts and course webpage. Lecture video (starts in slide 2 since I hit record button 30 seconds too late - sorry!) - slides (pdf) - slides (Powerpoint with ink and animation)...	15.3 parsecs away Travel
\|	\|	thenumb.at Functions are Vectors	17.4 parsecs away Travel
\|	\|		17.4 parsecs away Travel
\|	\|	pytorch.org Accelerating LLM Inference with GemLite, TorchAO and SGLang \| PyTorch	90.4 parsecs away Travel
\|		Large Language Models (LLMs) are typically very resource-intensive, requiring significant amounts of memory, compute and power to operate effectively. Quantization provides a solution by reducing weights and activations from 16 bit floats to lower bitrates (e.g., 8 bit, 4 bit, 2 bit), achieving significant speedup and memory savings and also enables support for larger batch sizes.	90.4 parsecs away Travel