For python tools:
Look at Quilt and frictionlessdata (actually for most languages including julia)
IIRC both are commercial concerns.
Niether can consume JSON-LD directly AFAICK
but frictionlessdata has their own competing standard
They seem like good folk.
I suspect that one would benifit from a bit of generalizing:
function get_dataurls_from_webserver_index(datapage_url)
datapage = getpage(datapage_url)
data_hrefs = (attr(ele, "href")
for ele in eachmatch(sel"a", datapage.root))
if !match(r"^(To )?Parent( Directory)?$"i, text_only(ele)) && !match(r"(To )?Index"i, text_only(ele)
)
data_urls = joinpath.(datapage_url, data_hrefs)
end
Ok, cool! Put this in the misc_extractors file and reloaded kernel. (added as a ‘v2’ function, and had to add an end).
Tried to give it a spin but am getting an error: