I would like to use Julia for naive Bayes text classification. I have many investment fund names that i would like to regroup under the same master parent which can generally be recognized by looking at both name. I cannot do it by hand but i could inject some examples into some machine learning algorithm.
Would you have any recommendation on how to do that in Julia?
I just merged a simple implementation into TextAnalysis.jl. It’s not released (so you’ll need master of the package) and doesn’t have any documentation yet, but if you’re happy experimenting, it’s there.
Example usage is in the tests: https://github.com/JuliaText/TextAnalysis.jl/blob/master/test/bayes.jl