Hi I am loading a long list of text files into memory using DataFrames (joined on the Date Column) It is taking very long to read them in so tried to do multithreading since my nthreads()=12 The multithreading is only working up to a point though as seen in the table below and 8000 files takes 10…

Read csv files slow

laborg July 28, 2020, 9:26am 4

You need to broadcast the vector of files. Try this (untested) code instead:

fileDirectory="d:/Data/Test/"
using Glob
files=glob("*.txt", fileDirectory) 
df3=DataFrame.(CSV.File.(files))

Read multiple csv files

Topic		Replies	Views
CSV.read very slow when number of threads changed General Usage multithreading , csv	2	297	September 18, 2023
Reading and processing Data files concurrently Data parallel	18	3810	September 20, 2017
Parallel feeding of a dataframe Performance multithreading , dataframes , multiprocessing	17	289	March 26, 2025
Performance Report: Effect of Reading CSV file on Mergeing two DataFrames Performance question , dataframes , csv	18	488	November 17, 2023
Read multiple csv files Performance filesystem	1	1438	July 28, 2020