I have a dataset of $N=3000$ biopsies from humans, each of which have an outcome I am trying to examine using covariates of the patient who provided the biopsy. Some biopsies from the same patient are positive whereas others from the same patient are negative, which is why I am using the biopsy (and not patient) as my unit of analysis. A single patient will have approximately 5-20 biopsy data points (or rows) in my excel sheet, which I think makes my data not independent or at least highly clustered. My outcome is binary (positive/negative).
Therefore, since I have 3000 biopsies from 150-600 patients, and my unit of analysis is biopsy, how can I model this data without violating the independent requirement of regression analysis?