Match cheaters
catch_em(flist, n_grams = 10, time_lim = 1L, progress_bar = TRUE)
flist | a list of documents ( |
---|---|
n_grams | see |
time_lim | max time in seconds for each comparison. Defult is 1 second, had no problem comparing documents with 50K words. |
progress_bar | Should a progress bar be printed to the console? |
A correlation matrix of class chtrs
with each cell indicating the match (0-1) between two of the documents.