Pluging: Julia | Dataiku

Hi, I had installed the pluging in DataIku to create and execute Julia recipes the same way you would use any other code recipes. It makes that a Julia kernel also becomes available for Jupyter notebooks.

I am using the Dataiku dataset reading package, where it doesn’t seem to recognize entered datasets. The code used is:

using Dataiku, DataFrames
import Dataiku: get_dataframe
df = get_dataframe(dataset"PROJECTKEY.myDataset")

It ran into this error:

UndefVarError: isnothing not defined

Stacktrace:
[1] get_context() at /opt/dss/data_dir/code-envs/julia/packages/Dataiku/lgsdz/src/request.jl:60
[2] #get_url_and_header#11(::Bool, ::String, ::Nothing, ::Bool, ::Function, ::String) at /opt/dss/data_dir/code-envs/julia/packages/Dataiku/lgsdz/src/request.jl:79
[3] #get_url_and_header at ./none:0 [inlined]
[4] #request#19(::Bool, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::Function, ::String, ::String, ::String) at /opt/dss/data_dir/code-envs/julia/packages/Dataiku/lgsdz/src/request.jl:133
[5] request at /opt/dss/data_dir/code-envs/julia/packages/Dataiku/lgsdz/src/request.jl:129 [inlined]
[6] #request_json#18(::Bool, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::Function, ::String, ::String, ::String) at /opt/dss/data_dir/code-envs/julia/packages/Dataiku/lgsdz/src/request.jl:117
[7] request_json at /opt/dss/data_dir/code-envs/julia/packages/Dataiku/lgsdz/src/request.jl:117 [inlined] (repeats 2 times)
[8] get_schema(::DSSDataset) at /opt/dss/data_dir/code-envs/julia/packages/Dataiku/lgsdz/src/Datasets.jl:548
[9] #_get_reading_schema#28(::Bool, ::Function, ::DSSDataset, ::Array{Any,1}) at /opt/dss/data_dir/code-envs/julia/packages/Dataiku/lgsdz/src/Datasets.jl:102
[10] #_get_reading_schema at ./none:0 [inlined]
[11] #get_dataframe#27(::Bool, ::Array{String,1}, ::Array{String,1}, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::Function, ::DSSDataset, ::Array{Any,1}) at /opt/dss/data_dir/code-envs/julia/packages/Dataiku/lgsdz/src/Datasets.jl:82
[12] get_dataframe at /opt/dss/data_dir/code-envs/julia/packages/Dataiku/lgsdz/src/Datasets.jl:82 [inlined] (repeats 2 times)
[13] top-level scope at In[1]:3

Can you please let me know on how I can overcome this issue ?

Thank you in advance for your return,

Best regards.

What version of Julia are you running? (you can call versioninfo() to know)

FWIW I’m getting a different error on Julia 1.7.2

But in any case that package hasn’t been updated in over a year, it’s using pretty ancient versions of its dependencies (e.g. CSV 0.5 when current release is 0.10) and tests fail as well. Probably worth opening an issue to see if it’s still maintained.

Julia Version 1.0.4
Commit 38e9fb7f80 (2019-05-16 03:38 UTC)
Platform Info:
OS: Linux (i686-pc-linux-gnu)
CPU: Intel(R) Xeon(R) CPU @ 2.30GHz
WORD_SIZE: 32
LIBM: libopenlibm
LLVM: libLLVM-6.0.0 (ORCJIT, haswell)
Environment:
DKUJULIADEPOT = /opt/dss/data_dir/code-envs/julia
JULIA_DEPOT_PATH = /opt/dss/data_dir/code-envs/julia
DKUJULIABIN = /var/lib/snapd/snap/bin/julia

Ok, isnothing is not defined there, you need a more recent version, 1.6 would be a good choice.

1 Like

I would say 1.7.2 would be an excellent choice given that it is the current stable release that almost everyone should be using?

4 Likes