Sorry to those of you who get upset with me. I will admit I haven’t spent the time to really try to learn Julia hardcore, mostly in free time off and on. But I have tried to read the docs in depth and they are hard to understand most of the time. I finally found some stuff to help me even read function definitions. I hope someone eventually writes a full manual going from R to Julia for data scientists. Maybe this exists and I don’t know. I can only use Julia right now for matrix algebra. The other stuff I still find a bit hard to understand and learn. I’ll wait until the documentation gets a little better to come back to Julia, but for now I’ve mostly left. Eventually I’m sure it will get easier to use with better packages.
We (as data scientists) also need to wait until there are more statistics packages in Julia. Right now it seems a bit light and not that great. I think Bates has moved over to Julia though, which will help with mixed models.
One thing I find hard is recycling things. Like creating a table and then pulling out the second value in the table in Julia. Maybe this is easy and I haven’t looked into it.
@ScottPJones Yes, I think it might take a year or so to get it stable enough with enough documentation. However, some of us need it to finish a PhD in 6 months or so :). I agree, Julia has improved leaps and bounds since I just started a bit a view years ago. At that time still very unstable, but it has gotten a lot better. We (in biological sciences) appreciate all the work from developers. My point in commenting on discourse is to make sure computer scientists don’t write a language for computer scientists. If it’s going to be a scientific computing language at all, it should be geared towards as easy as possible. We just don’t have time to understand all the nuances of the language. But base matrix algebra I find quite easy. Many other things as well. It’s the things about it being a ‘static’ (ish) language that I don’t understand. One thing is the lack of simple as._____ functions like in R. I think this is because of the static ish nature of julia I can’t just convert anything to anything easily. There are different syntaxes and functions I don’t understand. One guy actually emailed them when I asked, but didn’t find it on the docs yet. Hopefully these things will be explained better to simple R users like myself. I guess I’m trying to be constructive, but it doesn’t come off that way over this discourse. I just want to give my perspective from an R user. The people that came from C++ I know, find it to be very “easy”, where the R/bash user finds it very difficult. All I’m saying is to not forget about the R users if we want Julia to excel in Data Science (back to the original post). If it’s easy for C++ you can grab them, but the whole goal of Julia I thought was to be an in-between so R users would be a huge market to grab from. Right now it still seems too difficult for regular R users.
@felix I would do a side-by-side but I can’t figure out the dataframes packages. There are very simple examples I’m finding online, but nothing like the complexity I have with my real data. They are too simple for me to go from that to complex problems. One thing I’ve really been frustrated by is the dataframes package in Julia. Not really because it’s bad or poorly designed, it’s just because it lacks documentation in my opinion. And this is where most open source stuff falls on it’s face. The rest of us will need a Hadley Wickham to come along for Julia to explain what’s going on. I really struggle to even understand the base docs for Julia. Again, because we come from biology, not from programming and computer science. I’m sure you guys think we are probably stupid, but we only have so much time to devote to programming and learning another language. I’ve finally started to find some really good stuff (one presentation that made me understand some very basic things about julia not before discussed anywhere else). I should start making a list of things I find difficult. Many that commented are asking me for concrete things. I think it will take someone eventually writing an R to Julia book.
@Yifan_Liu Yes, cant’ beat dplyr although some on here think otherwise. I don’t know any MATLAB, maybe that’s why I struggle so much in Julia. I just take it one day at a time. But it takes me whole day sometimes to figure out the simplest little thing about Julia. So I give up a lot and start again next week.
@Tamas_Papp But the people who write code don’t always see why it’s unfriendly to the end user (such as R users). For instance, I think some people love data.table in R. But I’ve tried to learn a few times before and give up. It’s just complex and not straightforward. It is I’m sure for those that wrote it and thought about it. It is cheap to produce, but if people are complaining does that not suggest that maybe the people programming it could do something to either make it easier for the end user or write enough documentation so we understand. You do need contributors, but all of you developers will have wasted your time if it’s a let down for those who try it. We have at least 4 people here in my department that tried it and stopped just because it’s that much more difficult to master than R. Maybe that will change and we can get them back after a stable version and more documentation. Hard to write I know when that language changes so rapidly.
@DoktorMike I guess we’ll have to agree to disagree… I tell everyone that asks me to hold off another year or two (maybe more). Otherwise they will just get frustrated and leave. I’m not sure I’ll ever stop using R (we’ll see), but Julia can replace the low level languages we need to write fast enough software for genetic/bioinformatic research. Still not sure if it will replace C/Java/Fortran for the production code/software we have in quantitative genetics.
@pfitzseb Sorry, I’m not sure what to say. I haven’t used it in a while. Maybe Atom has gotten better. I just don’t like the way it looks or anything. I just find TexStudio/RStudio way better designed and easy to use. I’ve asked people in my department and they agree with me 100%. No one here that has tried Julia likes Atom. They have only had issues as well. Maybe this will get better. But I would suggest starting from scratch and trying to mimic RStudio exactly. I hate jupyter too. Not sure how people can stay in that environment all day. Good for teaching I guess, but not what I want for my coding day to day. I’ll try VSCode. I’m sure it will get better and better. I comment on here only to draw developer attention to those of us struggling to learn the “super easy” language of Julia. I’m finding some parts very easy (those you find online). When I want to do something very complex, like write genetics software, I fall on my face and can’t figure out how to do much. So I think it’s oversold as easy and when you get there it’s disappointing. I found python many folds easier to learn, but gave up because the package manager was a joke (didn’t seem to have one that worked from what I could tell). I really hope there are no more issues with packages in Julia or it will also fail. I undervalued the CRAN when I started in Julia and python. It’s maybe one of the best things about R. Have only had 1 or 2 issues ever with downloading packages in R. This is under-valued I think by developers who come from C/C++ maybe…
@pdeffebach Maybe I have just gotten used to the weirdness of R. I guess I’ve struggled to complete complex things with dataframes in Julia. There are simple examples on line I follow and then cannot figure out more complex ones, such as summing up the number of missing values for each animal and calculating a percentage based on the number of observations for each unit (animals for me). This is quite easy with dplyr in R. Can’t find the docs to do this quite yet in Julia, although admittedly I haven’t looked super hard. But I have looked into the docs for each of those packages and can’t figure out how to do much more complex things than they have listed. I’d just like to see many more types of data wrangling on the website.