Explore >> Select a destination


You are here

stribny.name
| | srcco.de
8.2 parsecs away

Travel
| | Scraping HTML websites for information is a common task. This blog post shows how to extract information via the Beautiful Soup (bs4) Python library. Some wanted information is buried in semi-structu
| | www.gregreda.com
12.3 parsecs away

Travel
| | A beginner's guide to getting started with web scraping using Python and BeautifulSoup.
| | www.paepper.com
11.9 parsecs away

Travel
| | If you want to learn how to create embeddings of your website and how to use a question answering bot to answer questions which are covered by your website, then you are in the right spot. The Github repository which contains all the code of this blog entry can be found here. It was trending on Hacker news on March 22nd and you can check out the disccussion here. We will approach this goal as follows:
| | stribny.name
10.1 parsecs away

Travel
| [AI summary] The article explains how to extract plain text from an HTML page using Python with the BeautifulSoup library and lxml parser, providing a function that recursively gathers text from specified block elements.