The words that the extract_words() function finds can be negatively affected by words that are seemingly far away and unrelated. This is due to the two-step process the method employs: grouping lines ...
This will be a pain in the ass considering the span soup created by GDocs, but it ultimately needs to be done for this to be anywhere near feature-complete.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results