Thanks, this is helpful! I had 3 spreadsheets in my Excel file, so I had to put each into an individual CSV file before attempting to implement your suggestion. Unfortunately, I get a weird error:
using HTTP, CSV
data_url = "https://github.com/MichaelBarmann/ParamEst_MRE/blob/main/sample_data.csv"
my_file = CSV.File(HTTP.get(data_url).body; header = 1)
┌ Warning: thread = 1 warning: parsed expected 1 columns, but didn't reach end of line around data row: 53. Ignoring any extra columns on this row
└ @ CSV C:\Users\Michael\.julia\packages\CSV\CJfFO\src\file.jl:606
┌ Warning: thread = 1 warning: parsed expected 1 columns, but didn't reach end of line around data row: 57. Ignoring any extra columns on this row
└ @ CSV C:\Users\Michael\.julia\packages\CSV\CJfFO\src\file.jl:606
┌ Warning: thread = 1 warning: parsed expected 1 columns, but didn't reach end of line around data row: 149. Ignoring any extra columns on this row
└ @ CSV C:\Users\Michael\.julia\packages\CSV\CJfFO\src\file.jl:606
┌ Warning: thread = 1 warning: parsed expected 1 columns, but didn't reach end of line around data row: 163. Ignoring any extra columns on this row
└ @ CSV C:\Users\Michael\.julia\packages\CSV\CJfFO\src\file.jl:606
┌ Warning: thread = 1 warning: parsed expected 1 columns, but didn't reach end of line around data row: 178. Ignoring any extra columns on this row
└ @ CSV C:\Users\Michael\.julia\packages\CSV\CJfFO\src\file.jl:606
┌ Warning: thread = 1 warning: parsed expected 1 columns, but didn't reach end of line around data row: 179. Ignoring any extra columns on this row
└ @ CSV C:\Users\Michael\.julia\packages\CSV\CJfFO\src\file.jl:606
178-element CSV.File{false}:
CSV.Row: (<!DOCTYPE html> = "<html lang=\"en\" >",)
CSV.Row: (<!DOCTYPE html> = " <head>",)
CSV.Row: (<!DOCTYPE html> = " <meta charset=\"utf-8\">",)
CSV.Row: (<!DOCTYPE html> = " <link rel=\"dns-prefetch\" href=\"https://github.githubassets.com\">",)
CSV.Row: (<!DOCTYPE html> = " <link rel=\"dns-prefetch\" href=\"https://avatars.githubusercontent.com\">",)
CSV.Row: (<!DOCTYPE html> = " <link rel=\"dns-prefetch\" href=\"https://github-cloud.s3.amazonaws.com\">",)
CSV.Row: (<!DOCTYPE html> = " <link rel=\"dns-prefetch\" href=\"https://user-images.githubusercontent.com/\">",)
CSV.Row: (<!DOCTYPE html> = " <link crossorigin=\"anonymous\" media=\"all\" integrity=\"sha512-+D6Fy+EweT791upBwWDwZQxdKdpY58/tEdY1xnOvq7urHPMZe2ZtMhRmzWUL6+Dknj2BwHdlVvxPHDB6ep7wQQ==\" rel=\"stylesheet\" href=\"https://github.githubassets.com/assets/frameworks-f83e85cbe130793efdd6ea41c160f065.css\" />",)
CSV.Row: (<!DOCTYPE html> = " <link crossorigin=\"anonymous\" media=\"all\" integrity=\"sha512-B+uagPXLI8XoStGPc9DrQ9xVWfBS0VvAKvYTQopaizdxo8T46TCtAVE9sY15FQ0kseA1LB3V5GKklDGuwjqCHg==\" rel=\"stylesheet\" href=\"https://github.githubassets.com/assets/site-07eb9a80f5cb23c5e84ad18f73d0eb43.css\" />",)
CSV.Row: (<!DOCTYPE html> = " <link crossorigin=\"anonymous\" media=\"all\" integrity=\"sha512-BgutnhwyiKQLFDdsLnN+C/pdoX66e9VdVPnjGR1KkI6lMDTbqn8SWQLjp3dqzfGblsErstNfjUq94M/kZ8/1qA==\" rel=\"stylesheet\" href=\"https://github.githubassets.com/assets/behaviors-060bad9e1c3288a40b14376c2e737e0b.css\" />",)
CSV.Row: (<!DOCTYPE html> = " ",)
CSV.Row: (<!DOCTYPE html> = " ",)
CSV.Row: (<!DOCTYPE html> = " ",)
⋮
CSV.Row: (<!DOCTYPE html> = "</div>",)
CSV.Row: (<!DOCTYPE html> = " </div>",)
CSV.Row: (<!DOCTYPE html> = "</div>",)
CSV.Row: (<!DOCTYPE html> = " </main>",)
CSV.Row: (<!DOCTYPE html> = " </div>",)
CSV.Row: (<!DOCTYPE html> = " </div>",)
CSV.Row: (<!DOCTYPE html> = " ",)
CSV.Row: (<!DOCTYPE html> = "<div class=\"footer container-xl width-full p-responsive\" role=\"contentinfo\">",)
CSV.Row: (<!DOCTYPE html> = " <div class=\"position-relative d-flex flex-row-reverse flex-lg-row flex-wrap flex-lg-nowrap flex-justify-center flex-lg-justify-between pt-6 pb-2 mt-6 f6 color-text-secondary border-top color-border-secondary \">",)
CSV.Row: (<!DOCTYPE html> = " <ul class=\"list-style-none d-flex flex-wrap col-12 col-lg-5 flex-justify-center flex-lg-justify-between mb-2 mb-lg-0\">",)
CSV.Row: (<!DOCTYPE html> = " <li class=\"mr-3 mr-lg-0\">© 2021 GitHub",)
CSV.Row: (<!DOCTYPE html> = " <li class=\"mr-3 mr-lg-0\"><a href=\"https://docs.github.com/en/github/site-policy/github-terms-of-service\" data-ga-click=\"Footer",)
Any idea what’s going wrong? Do I need to specify some additional keywords for it to property parse the data? (I tried including header = 1
, and specifying the column types, but it didn’t change anything…)