How can I convert this type of dictionary into a dataframe?

Aizzaac · February 7, 2020, 7:44pm

Hi

I have used a Python SDK to extract data from Splunk.
In the image you can see an example of the output.

I want to send that information to a DataFrame.

Any clues will be helpful.

Thank yoi

tbeason · February 7, 2020, 7:58pm

Assuming that you have those in a vector Dv,

vcat(DataFrame.(Dv)...)

would work. You could also convert them to NamedTuples instead of Dicts and then construct the DataFrame.

Aizzaac · February 7, 2020, 8:21pm

I am iterating. This is the original code:

tbeason · February 7, 2020, 8:27pm

That doesn’t change anything. You can either collect the Dicts into a vector during your loop and use my suggestion afterwards, or you can create one-row DataFrames while you iterate (using df = DataFrame(item)) and collect them into a single one afterwards (again using vcat).

Aizzaac · February 7, 2020, 8:38pm

This is the second option you can create one-row DataFrames while you iterate:

It seems to be working.

But how can I see the dataframe?

tbeason · February 7, 2020, 8:52pm

That is not going to be doing what you want, you aren’t actually accumulating anything. Try this

Dv = Vector{Dict}()
for item in reader
    push!(Dv,item)
end
df=vcat(DataFrame.(Dv)...)

And, just taking a guess here,

df=vcat(DataFrame.(reader)...)

would probably work (since I see you are just printing with the loop anyway).

Aizzaac · February 7, 2020, 9:11pm

This is what I get

tbeason · February 7, 2020, 9:13pm

You need to using DataFrames

Aizzaac · February 7, 2020, 9:15pm

I think is the same problem I have with Python:

JULIA

Aizzaac · February 7, 2020, 10:14pm

Okey. It has worked in Python.
Now I will check Julia.

Aizzaac · February 9, 2020, 3:51pm

The iteration never accumulates anything.

pdeffebach · February 9, 2020, 4:28pm

Can you give a MWE with code written in backticks, as follows, rather than screenshots?

```
Your code here
```

It seems like reader might be empty. Having a MWE will help determine that that’s the problem.

Aizzaac · February 9, 2020, 7:13pm

This is the code. It is now working!!!
I used “collect”.

results=pyimport("splunklib.results")

kwargs_oneshot = (earliest_time= "2019-09-07T12:00:00.000-07:00",
              latest_time= "2019-09-09T12:00:00.000-07:00",
              count=0)

searchquery_oneshot = "search index=iis | lookup geo_BST_ONT longitude as sLongitude, latitude as sLatitude | stats count by featureId | geom geo_BST_ONT allFeatures=True | head 2" 

oneshotsearch_results = service.jobs.oneshot(searchquery_oneshot; kwargs_oneshot...)

# Get the results
reader = results.ResultsReader(oneshotsearch_results)

# collect them into an array
Dv = collect(reader)

using DataFrames
df=vcat(DataFrame.(Dv)...)

Topic		Replies	Views
Best practice for the conversion from a vector of dictionaries to a dataframe General Usage dictionary , dataframes	1	1433	September 16, 2021
Vcat multiple DataFrames General Usage	1	2148	May 19, 2021
Convert dictionary of dataframes into single dataframe General Usage	5	1220	March 30, 2021
DataFrame to Dict via Vector of Nested Named Tuples New to Julia jump , dataframes , namedtuple	2	534	November 28, 2021
[DataFrames Question]: How to convert single column with row of dictionary to multiple columns Specific Domains question , dataframes	4	520	May 14, 2022

How can I convert this type of dictionary into a dataframe?

Related topics