I have a dataset where I need to find the relation between the smoking habits of a person (Yes/No) with the probability risk of a person getting a heart disease.
Please suggest how can I find the correlation between 2 level categorical feature( smoking) to continuous variable(Risk) in Python. Can Pearson correlation coefficient be used ?