Explore >> Select a destination


You are here

comsci.blog
| | www.nicktasios.nl
0.9 parsecs away

Travel
| | In the Latent Diffusion Series of blog posts, I'm going through all components needed to train a latent diffusion model to generate random digits from the MNIST dataset. In this first post, we will tr
| | sebastianraschka.com
3.0 parsecs away

Travel
| | Previously, I shared an article using multi-GPU training strategies to speed up the finetuning of large language models. Several of these strategies include...
| | www.paepper.com
0.9 parsecs away

Travel
| | When you have a big data set and a complicated machine learning problem, chances are that training your model takes a couple of days even on a modern GPU. However, it is well-known that the cycle of having a new idea, implementing it and then verifying it should be as quick as possible. This is to ensure that you can efficiently test out new ideas. If you need to wait for a whole week for your training run, this becomes very inefficient.
| | blog.ephorie.de
11.7 parsecs away

Travel
| [AI summary] The blog post explores the connection between logistic regression and neural networks, demonstrating how logistic regression can be viewed as the simplest form of a neural network through mathematical equivalence and practical examples.