Wikipedia - KL properties says that KL can never be negative. But e.g. for texts where the probabilities are very small I somehow get negative values? E.g.
Collection A: - word count: 321 doc count: 65888 probA: 0,004871904
Collection B: - word count: 1244 doc count: 120344 probB: =0,010337034
KL = $0.004871904 \cdot \ln\frac{0.004871904}{0.010337034} = -0.003664881$