Scraping a html table from a website

I am trying to obtain COVID-19 data from a website. The website has the data which I want but it is in a html table format

I am looking for Julia tools to scrape the information from the html table. Something like this

https://towardsdatascience.com/web-scraping-html-tables-with-python-c9baba21059

Is there an existing package in Julia for this?

I am thinking about writing my own module to do this but I ask here first before writing my own module.

1 Like
2 Likes

Yes, the combination of Gumbo and Cascadia is what you need. There is some example code in the README for Cascadia, and some more examples in this SO post: https://stackoverflow.com/questions/42915962/extracting-and-constructing-tables-from-html-files-using-julia

2 Likes