I'm looking to test some Morphological Analysis tools, which split sentences into tokens and provide part of speech and pronunciation information (specifically, providing kana when given kanji).
I'm after some sentences which automated tools would have trouble with. Examples of what I'm after are sentences which are difficult to determine the word boundaries, pronunciation or part of speech. Preferably, examples would be a more correct interpretation, and be something that could be seen in actual use.
The best I have found so far is "私はあくまで執事ですから". In a similar vain, is "にわにはにわとりがいる (庭には二羽鶏が居る)" which only applies to spoken use. An english example might be something like "Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo", which is difficult to determine the part of speech each word is.