Questions tagged [distributed-computing]
6 questions
3
votes
1 answer
Distributed PCA or an equivalent
We normally have fairly large datasets to model on, just to give you an idea:
over 1M features (sparse, average population of features is around 12%);
over 60M rows.
A lot of modeling algorithms and tools don't scale to such wide datasets.
So…

Tagar
- 143
- 7
2
votes
0 answers
Distributed Statistical Inference
I am interested in statistical inference using distributed methods, which basically means $m$ machines receiving a subset of the data $X_m$. We have some unknown distribution $\mu_\theta$ in a parametric family $P$, and we are trying to estimate…

twnly
- 121
- 3
1
vote
0 answers
Fake distributed computation - secure summation on IRLS for binary logistic regression
I am attempting to perform an IRLS algorithm to estimate regression parameters for a logistic regression model.
This is the algorithm that I am following
Select initial values for the regression parameters $\boldsymbol{\beta}^{\text {old…

Susan-l3p
- 83
- 5
1
vote
0 answers
What data is sent from a client to a server in a federated learning setting?
So far, I thought federated learning works like this:
All clients have the same machine learning model (if not personalized). They have their unique data and then train this model (e.g., neural network) with their data. Then, they have a new matrix…

FedL
- 11
- 1
1
vote
0 answers
Logistic regression on HDFS, what's the algorithm?
How does Spark (or something similar) estimate a logistic regression model, or any statistical model that is estimated by an optimization algorithm, when the data are stored in a distributed environment, such as HDFS?
I read/heard that each…

Glen
- 6,320
- 4
- 37
- 59
0
votes
0 answers
fast distance metric between a new data entry and available clusters of data
Assume we divide a large data set D into m different partitions of data in a distributed learning case. We do the training in the clusters and they are local experts. Now, we have some new test points that we want to assign to the partitions and…

Ham82
- 113
- 5