Questions tagged [structured-prediction]

13 questions
7
votes
1 answer

Deep Learning vs Structured Learning

I am interested in the differences between using large, deep learning networks vs Probabilistic graphical models (PGMs), like Random Field models, for structured learning (e.g. on images, or labels of arbitrary graphs on surfaces, etc...). For…
6
votes
1 answer

predicting tree structure

This topic is actually rather hard to google for as 'tree' has been overloaded in this domain to refer to decision trees. I'd be interested in having a learning algorithm produce code, such as used in Microsoft Power BI's feature to query databases…
3
votes
1 answer

Any reasons to prefer neural networks over boosting methods in tabular data?

Based on Kaggle winners data, it seems that ensemble boosting methods like XGBOOST, LIGHTGBM, CATBOOST are the top choices when dealing with structured or tabular data for maximizing the prediction accuracy. However, in industry as far as I know,…
2
votes
0 answers

Any research on learning Bayesian network structure with a limit on the parent set size?

Learning a maximum-scored Bayesian network structure with bounded treewidth is rather popular in recent years, as stated in the paper A survey on Bayesian network structure learning from data in 2019. However, I failed to find out any recent…
2
votes
1 answer

Binary linear SVM in terms of structured SVM

On slide 32/85 of this tutorial on structured SVM learning, the author formulates binary SVM classification in terms of structured output SVM by defining $\Psi(x, y) = \frac{y}{2}x$. Why is there a constant factor of $1/2$ in the feature map and…
1
vote
0 answers

How to model a problem where the aim is to predict if atleast one of 3 inputs is "different" from the rest?

I have a tabular dataset with 10000 rows, each with a RowID and 30 numerical features. There are multiple rows with the same RowID. The aim is to come up with a model, which at test time would take 3 rows (without the RowID) and predict if at least…
1
vote
0 answers

About Feature Engineering Tips When “discriminative algorithm care about no modeling the probability of the language”

I was going over my old NLP course slides and one of the pages is about using Structured Perceptron for tagging. It claims that because the algorithm is discriminative, it doesn't care about modeling the probability of the language, and thus "every…
1
vote
0 answers

Regression - many samples have the same target

I have a machine learning problem in which I have a many-to-one relationship from samples to targets. I have ~3k samples but only 11 targets with a shared key YEAR My first approach was to reshape the targets to get a one-to-one relationship, but…
TobSta
  • 111
  • 1
1
vote
0 answers

How to encode dependence structure in regression (Bayesian networks)

Suppose you know the model should be of the form y =f(x1)g(x2,x3), where f and g are the functions I'm trying to find. Essentially, x2, x3 collectively predict some hidden variable z, and the response variable y is z dampened by some function of x1.…
1
vote
0 answers

Options for spatially correlated outputs (i.e. structured regression)?

I have a convolutional neural network which takes a 40x40 real valued input, and maps it to a real valued 40x40 output. I've optimized the number of convolution layers, filter size, hidden layer size, activations and optimiser. My current setup…
1
vote
0 answers

Which model for this information extraction problem?

I am trying to solve the following pattern recognition / information extraction problem. Assume I have a text where each token has been annotated by a single class among $K$ classes available (with a conditional random field for example). Results…
1
vote
0 answers

Linear CRF is to LR what _____ is to RF/SVM

This figure from (1) shows the relation between LR and linear CRF: linear CRF is pretty much the generalization of logistic regression to sequences: Is there any established generalization of random forest or SVM to sequences? (1) Sutton, Charles,…
Franck Dernoncourt
  • 42,093
  • 30
  • 155
  • 271
0
votes
1 answer

How to take advantage of the information/structure we have in the labels in multiple output regression?

I have a regression problem where each observation possesses a vector of features and 4 associated responses. These responses, as in many problems are correlated. It would be nice to be able to predict them "all at once" in order to take into…
Tom
  • 1,204
  • 8
  • 17