Scrap table from NASA GCN circulars website

raman_kumar · July 28, 2023, 4:38pm

using HTTP, CSV, DataFrames
function doanalysis()
    dfg=nothing
    for x in 21200
    print("\r peeking at GCN $x ")
        try
            url = "https://gcn.nasa.gov/circulars/$x/raw"
            resp = HTTP.get(url) 
            status=resp.status
            print(" ",status," "); 
            if status == 404 ; println("status=",status); continue; end          
            txt = String(resp.body)
            if occursin(r"GRB ?\d{6}([A-G]|(\.\d{2}))?",txt)
				m=match(r"GRB ?\d{6}([A-G]|(\.\d{2}))?",txt)
				print(m.match)
			end

            if occursin("GROND observations", txt)
                println(" GROND report")                
                he=first(findfirst(r"^(g'|r'|i'|z'|J|H|K)"m,txt))
                lr=first(findnext(r"^(?:[\t ]*(?:\r?\n|\r))+"m,txt,he))
                cltxt=replace(txt[he:lr], r" ?(=|>)"=>"|" , "+/-"=>"|")
                df=CSV.read(IOBuffer(cltxt), DataFrame, delim="|" ,header=0)
                df.GCN=[x for i in 1:nrow(df)]
                df.GRB=[m.match for i in 1:nrow(df)]                  
				if isnothing(dfg) 
                    @show dfg=df
                else
                    @show dfg=vcat(dfg,df)
                end # if x is first
            end # if occursin
        catch e
            println("error ")                    
        end # trycatch
    end # for loop
end
doanalysis()

give output shown below GCN 21200

Topic		Replies	Views
Web scraping of GCN NASA circulars TEXT General Usage http	16	543	June 30, 2023
Combining data of different GCNs in a single file Data strings , data , loops , dataframes	21	639	August 1, 2023
How to handle HTTP.Exceptions.StatusError Web Stack http	16	446	July 16, 2023
How to repleace some chars in a HTTP --> CSV --> DataFrame workflow? General Usage dataframes , csv , http	4	433	March 30, 2021
Fatal error while reading in messy data using DataFrames, CSV Data dataframes , csv	6	626	May 25, 2021

Scrap table from NASA GCN circulars website

Related topics