I find I best learn by example but I can't seem to find any that match with this, or at most, people appear bizarrely unwilling to show where in these abstract equations you actually insert which numbers to do an actual calculation. Or make some assumptions like the domain being Gaussian that doesn't seem to apply to what I'm doing.
So I have a table of binary training data, that is, 3 input columns which I'll label x1
, x2
, and x3
, and an output column which I'll call C
. All of these simply contain yes/no (or 1/0 if you prefer) answers. Feel free to redefine these names if they collide with your preferred notation.
Here is an example table of data, I suspect this will not format nicely on mobile devices.
+-----+-----+-----+-----+
| x1 | x2 | x3 | C |
+-----+-----+-----+-----+
| yes | yes | no | no |
| no | yes | no | yes |
| yes | yes | no | no |
| no | yes | no | no |
| yes | no | yes | yes |
| no | yes | no | yes |
| no | yes | no | no |
| yes | no | yes | yes |
| yes | yes | no | yes |
| no | yes | no | no |
| yes | no | no | yes |
+-----+-----+-----+-----+
$P(C=y) = 6/11$, $P(C=n) = 5/11$
What should I do with this to produce some sort of MLE? What does that tell me about my data? Am I missing the point?
I have seen this CV.SE question, and I feel like it's not a big step to apply its accepted answer to my data but I just can't connect it with this data set, I need to see an example of it being used.
Thank you very much. :) If there's any further info you need do comment and ask, I'm new to both CV and fairly new to this sort of thing in general