This may be a lower level question, so thank you in advance to any patient person who has an answer for me. I develop non-linear mixed effects models of pharmaceutical kinetics. One tool I'm frequently using is the observed Fisher information matrix. As I understand, the standard definition is,
$$ \mathbf{I}(\theta)=-\frac{\partial^{2}}{\partial\theta_{i}\partial\theta_{j}}l(\theta),~~~~ 1\leq i, j\leq p $$
What I use it most frequently for is deriving the standard error and variance of parameter estimates via,
$$ \mathrm{Var}(\hat{\theta}_{\mathrm{ML}})=[\mathbf{I}(\hat{\theta}_{\mathrm{ML}})]^{-1} $$
However, I am unable to find a derivation which explains why this works :/ Can anyone explain to me why the inverse of the Fisher information matrix is an estimate of the variance-covariance matrix for the population parameters?
Thanks so much m-_-m
EDIT: I found the answer I wanted using the following document
- Name: A Tutorial on Fisher Information
- Authors: Alexander Ly, Maarten Marsman, et Al
- https://arxiv.org/pdf/1705.01064.pdf