|
You are here |
gowers.wordpress.com | ||
| | | | |
francisbach.com
|
|
| | | | | [AI summary] This text discusses the scaling laws of optimization in machine learning, focusing on asymptotic expansions for both strongly convex and non-strongly convex cases. It covers the derivation of performance bounds using techniques like Laplace's method and the behavior of random minimizers. The text also explains the 'weird' behavior observed in certain plots, where non-strongly convex bounds become tight under specific conditions. The analysis connects theoretical results to practical considerations in optimization algorithms. | |
| | | | |
www.jeremykun.com
|
|
| | | | | Machine learning is broadly split into two camps, statistical learning and non-statistical learning. The latter we've started to get a good picture of on this blog; we approached Perceptrons, decision trees, and neural networks from a non-statistical perspective. And generally "statistical" learning is just that, a perspective. Data is phrased in terms of independent and dependent variables, and statistical techniques are leveraged against the data. In this post we'll focus on the simplest example of thi... | |
| | | | |
peterbloem.nl
|
|
| | | | | [AI summary] The pseudo-inverse is a powerful tool for solving matrix equations, especially when the inverse does not exist. It provides exact solutions when they exist and least squares solutions otherwise. If multiple solutions exist, it selects the one with the smallest norm. The pseudo-inverse can be computed using the singular value decomposition (SVD), which is numerically stable and handles cases where the matrix does not have full column rank. The SVD approach involves computing the SVD of the matrix, inverting the non-zero singular values, and then reconstructing the pseudo-inverse using the modified SVD components. This method is preferred due to its stability and ability to handle noisy data effectively. | |
| | | | |
statsandr.com
|
|
| | | Learn how to run multiple and simple linear regression in R, how to interpret the results and how to verify the conditions of application | ||