I am a beginner, so I am trying my best to describe my issue as clearly as possible.
I installed julia 1.2.0 in a windows 2012 with 2 CPU/socket (22 cores in each CPU) and 512GB RAM.
I am wondering whether I can start Julia workers to fully utilize the cores and config each worker with specific number of cores. I have the following code to start multiple workers
using Distributed
x= 44
addprocs(x+1-length(procs()))
then use @everywhere to load codes into the workers
then use remotecall to run the program in each worker
for i=1:nworkers() ; remotecall(func,workers()[i]) ; end
my observation was that if I do not use distributed, main process uses all my physical cores (50% of my logical cores, so OS task manager shows 50% usage), but when I use distributed, each worker only uses 1 core and there is no way to specify number of cores for each worker. it is very beneficial to me if I can do that since my worker processes different data so resource requirement is different.
@indycdrom - I think you are describing the difference between (I am oversimplifying) Distributed (across CPUs) and Threads (across Cores). If you start up Julia with multiple threads (see JULIA_NUM_THREADS in the docs), you should be able to get multiple Cores running on each of your workers.
as I said, I am a beginner so I wish someone can provide a little more detail or sample codes which can really save my day.
here is my task: Suppose I need to continuously and simultaneously process some data for five different cities and data comes in real time , but not at the same time. so I designed a simple process to detect the availability of the data and process the data if available and start over again.
using Distributed
x = 5
addprocs(x+1-length(procs())) @everywhere using Pkg; @everywhere using Pkg.activate(“./julia/myenv”) @everywhere using …
@everywhere function isavailable(cityname::String)::Bool
…
end
@everywhere function func(cityname::String)
while true
if isavailable(cityname)
println(“processing started”)
processdata(cityname)
println(“processing completed”)
end
sleep(5)
end
end
for i=1:nworkers()
remotecall(func,workers()[i], cityname[i]) ;
end
as you can see, my code is just plain Julia code with no macro and other advanced features. I can run this simple code with no issue, but the issues that Julia does not use up all my cores (based on the CPU usage and measured time to complete the data processing). To be specific, if I run it in the main process with no distributed feature, processdata only takes 3-4mins, but once distributed, it take 30mins to complete even only one worker is running . so clearly each worker is limited to small set of cores (i think it is just one because task manager shows only one core being busy)
I always assumed that under multiple processor setting, Julia will optimize the CPU resources for each worker which means if I have only one worker, it should have all the resources, but once I start multiple workers, they will share the resources based on some type of load control (since I have no way to specify number of cores for each worker).
I am not sure how to use multi-threads feature in my task. based on my sample code, any suggestion will be greatly appreciated.
important: start the julia withjulia -p 22 " Starting with julia -p n provides n worker processes on the local machine. Generally it makes sense for n to equal the number of CPU threads (logical cores) on the machine. Note that the -p argument implicitly loads module Distributed ."
add " export JULIA_NUM_THREADS=22 " ( you have to find the similar windows command )
" By default, Julia starts up with a single thread of execution. This can be verified by using the command Threads.nthreads():"
please create a minimal working example code we can re-run … examine / improve …
imho: your draft example is not perfect.
please use code formatting ( “</> Preformatted text” in the menu, CTRL+SHIFT+C )
As I know the full “multi-threading” is work in progress …
“Adding parallelism to the standard library. Many common operations like sorting and array broadcasting could now use multiple threads internally.”