I am trying to push Tokendocument into a corpus from the textanlysis.jl package.
Here is the code
y = Corpus() for i in data.row x = TokenDocument([m.match for m = eachmatch(r"\\[rnt](*SKIP)(*F)|\w+(?:['-,-:\/.(X X)]\w+)*",data.Omschrijving[i] )]) push!(y, x) end
the result is still an empty corpus but with X amount of documents like this:
A Corpus with 1730 documents: * 0 StringDocument's * 0 FileDocument's * 0 TokenDocument's * 0 NGramDocument's Corpus's lexicon contains 0 tokens Corpus's index contains 0 tokens
As you can see, I cannot enter all my documents into the corpus with push!() or by writing it out.
Does anyone have some ideas about resolving this?