|
You are here |
tsak.dev | ||
| | | | |
sixcolors.com
|
|
| | | | | Six Colors by Jason Snell, Dan Moren and friends | |
| | | | |
www.andrlik.org
|
|
| | | | | It is now clear that at least some AI companies are ignoring robots.txt that forbid them from scraping a site. Robb Knight wrote up a great guide for explicitly blocking those scraping bots via your Nginx config. However, this site is currently served by AWS CloudFront, which means that the content gets served without the request touching the source server. I was sure there had to be a way to do something similar with a CloudFront function, so I set out to try. | |
| | | | |
coryd.dev
|
|
| | | | | AI companies are crawling the open web to, ostensibly, improve the quality of their models and products. This process is extractive and accrues the benefit to said companies, not the owners of sites both small and large. | |
| | | | |
www.techradar.com
|
|
| | | ChatGPT is always changing | ||