I have a dataframe with 20 categorical variables, each with 30+ levels. As a result I don't have a target variable on hand per-say but I would like to use statistical techniques or machine learning to show specifically how the certain levels of each variable relate to eachother.
When we see "B" in Column D
is it expected to also see 'G' in Column J
?
I was thinking maybe finding counts of the variables but are there anyways in going past the Fisher Exact and Chi Squared tests? Maybe see the interplay between the distribution of frequencies on more than one variable??
My major point here is I would like to use machine learning to determine which levels are coming up more often than others, but without a target variable I am unsure how to proceed with feature selection. Seems unsupervised but I am unsure how I could go about pointing fingers at a specific level of a variable or show how they relate to one another?