I am practicing using the Naive Bayes classifier to predict whether people get a stroke or not, but, I am confused with two classifiers. One is categorical Naive Bayes, another is Gaussian Naive Bayes.
For example, in the dataset, there are several text attributes such as gender, ever_married, and ever_smoked. Some of the columns are numerical data. For standardization, I use dummies like sex = pd.get_dummies(df['gender'],drop_first=True)
to transform the text to binary, and then standardized the dataset and use the Gaussian Naive Bayes classifier to train the data. Is this the correct way to do it?
Or should I directly use the Categorical Naive Bayes to train the data? However, some columns are numerical that is not reasonable to use this classifier?
Any help is highly appreciated.