How can I use TileDB from Julia?

Juan · December 20, 2018, 7:07pm

How can I use TileDB from Julia? Is there any port? I’ve searched on Julia observer but haven’t find anything?
I’ve also searched on JuliaDB docs but haven’t found anything.

I’m looking for a fast database to deal with large datasets and be able to add new columns on disk. My candidates are hdf5, SciDB, MonetDB, Rasdaman and TileDB, the latter is supposed to be faster and optimized to work with matrices.

cstjean · December 20, 2018, 9:32pm

The Julia API is apparently planned. You could of course call it via PyCall, but it might not have such great performance.

Juan · December 20, 2018, 11:47pm

Do you have experience with TileDB?
Do you recommend it over other options?

ImreSamu · December 21, 2018, 3:02pm

imho: Maybe you can ask this question here : https://forum.tiledb.io/

ihnorton · November 18, 2019, 4:03pm

Hi folks, in order to gauge interest, I just created a votable feature request for Julia bindings for TileDB:

If you are interested, please give an upvote at the link above. We would also be happy for any comments if you have a specific use-case in mind (or send us a note at hello@tiledb.com).

As background, TileDB is a chunked array format + storage manager supporting N-dimensional dense and sparse arrays, with built-in support for AWS S3 and HDFS, encryption, and time-traveling, as well as integrations for Spark, Dask, GDAL, PDAL, VCF, and more. The core library provides a C API, and bindings are currently available for Python, R, Go, Java, and C++. Please see our docs for more information.

Thanks,
Isaiah (TileDB, Inc)

Juan · November 18, 2019, 5:16pm

Do you know how fast is TileDB compared to Clickhouse?

ihnorton · November 18, 2019, 9:16pm

Hi @Juan,

The core TileDB library is a storage engine that does not implement SQL queries. Rather, it may be used as a columnar store by SQL engines (currently PrestoDB, MariaDB and SparkSQL). The integrations we have built re-use the existing engine’s SQL capability, but push compute to storage whenever possible (currently ranges, in the future more complex processing). We have been discussing adding TileDB as a data connector to Clickhouse, which would give Clickhouse optimized AWS S3 data access. Users would also enjoy direct data access outside of Clickhouse, e.g., for running Python/R/C++/etc. code directly against the data store.

Thanks,
Isaiah

Topic		Replies	Views
Any fast database working with Julia? Data	3	954	November 25, 2018
ANN: JuliaDB.jl Community	40	9707	November 13, 2018
ClickHouse database and Julia Internals & Design question	4	1759	June 5, 2019
Package for reading/writing ~100GB data files General Usage	10	2883	November 17, 2018
Arrow, Feather, and Parquet Data parquet , arrow	48	12950	November 1, 2020

How can I use TileDB from Julia?

Related topics