It sounds like you're asking about visualizing the relationship between the class and each individual input feature. If that's the case, you can plot the conditional distribution of each feature, given that the class takes each particular value. Say you have features $\{x_1, \dots, x_p\}, x_i \in \mathbb{R}$ and class $y \in \{1, \dots, k\}$. For a given feature $x_i$, estimate $p(x_i \mid y=j)$ for each value of $j$. To do this, you can select the data points with the corresponding class and estimate the distribution of the desired feature using a kernel density estimator or histogram. Plot the density estimates together on the same axis, each colored according to the class. The left plot below shows an example for a binary classification problem (but this works for multiclass problems too).
You can also plot the probability that the class takes a particular value, given the value of the feature, that is $p(y=j \mid x_i)$. The density estimates already give $p(x_i \mid y)$, so use Bayes' rule:
$$p(y=j \mid x_i) = \frac{p(x_i \mid y=j) p(y=j)}{\sum_{l=1}^k p(x_i \mid y=l) p(y=l)}$$
You can estimate $p(y)$ using the frequencies of classes in the dataset. The right plot below shows an example (for the same dataset as the left plot). Only one curve is necessary for binary classification problems, because $p(y=0 \mid x_i) = 1 - p(y=1 \mid x_i)$. Multiple curves can be plotted for multiclass problems. Care must be taken when interpreting this plot because it may not be reliable in low density regions (where $p(x_i \mid y)$ is low for all classes); in these cases it can amplify noise in the density estimates. You could imagine plotting bootstrap confidence intervals.

Of course, this only gives a view of the marginal distributions, and the usual caveats apply. A feature on its own might contain no information about the class, but be highly informative in combination with other features. You could use the same type of plot to examine pairs of features, using bivariate density estimates.