Explore >> Select a destination


You are here

ssc.io
| | hyperwriteai.com
7.5 parsecs away

Travel
| | HyperWrite's custom Llama 3 70B model offers superior writing quality and the ability to access real-time information, outperforming other AI language models.
| | bartoszmilewski.com
8.2 parsecs away

Travel
| | There are many excellent AI papers and tutorials that explain the attention pattern in Large Language Models. But this essentially simple pattern is often obscured by implementation details and optimizations. In this post I will try to cut to the essentials. In a nutshell, the attention machinery tries to get at a meaning of a...
| | deepmind.google
5.5 parsecs away

Travel
| | We ask the question: "What is the optimal model size and number of training tokens for a given compute budget?" To answer this question, we train models of various sizes and with various numbers...
| | ronnascakeblog.wordpress.com
32.7 parsecs away

Travel
| You never know what someone is going to ask you to make for them. A toilet paper roll and a dozen poop emoji cupcakes. Sure, why not?This tongue-in-cheek cake was so fun to make. A vanilla toilet paper roll, surrounded by 12 chocolate (natch) poop emoji cupcakes was a gift from a husband to his...