What are the best practices to apply NER to large texts (e.g 20 pages+)?
One common advice is to split the text before passing it as input to the model. However this can require a significant manual work of establishing splitting rules, especially if there are many different templates.
What are other alternatives or complementary practices when building a NER solution for large documents?
Are there common tricks to have a generalizable approach with regards to the data splitting logic?