Questions tagged [chemistry]
27 questions
29
votes
1 answer
When if ever is a median statistic a sufficient statistic?
I came across a casual remark on The Chemical Statistician that a sample median could often be a choice for a sufficient statistic but, besides the obvious case of one or two observations where it equals the sample mean, I cannot think of another…

Xi'an
- 90,397
- 9
- 157
- 575
6
votes
1 answer
How to predict a binary outcome with unbalanced repeated measures data?
I’m somewhat lost as to how to approach a problem I have and was hoping someone can suggest the most appropriate method.
The problem:
I have a data set consisting of concentration values for 8 molecules. These are measured on approx. 230 samples…

ric
- 111
- 1
- 1
- 4
3
votes
0 answers
How do I record randomly assigned data succinctly in a table
Please forgive me if I am posting this in the wrong forum. Here is the context of my question:
I am conducting a drug trial, of sorts.
I have 7-8 potential candidates for the treatment.
Between each drug administration, there has to be a…

Rushi
- 141
- 4
3
votes
0 answers
How to estimate the uncertainty of a function that has to be solved numerically
Consider I have a function:
$PV_m^3 - PbV_m^2 +aV_m+ab=RT$
In this example I have measured the uncertainties in $P$ and $T$ experimentally and the errors in $a$ and $b$ can be assumed to be zero. How would I go about estimating the error in…

tionichm
- 41
- 5
3
votes
0 answers
can the parameters of this nonlinear equation be reliably estimated using a limited set of experimental data?
A chemical system at equilibrium is described by this equation:
$$
D_{tot} = D + D \cdot \frac {P \cdot X}{D+K_P} + D \cdot \frac {E}{D+K_E}
$$
The only parameters that the experimenter can control (by setting up the experiment with different…

user6376297
- 539
- 2
- 10
3
votes
2 answers
Intersections of chemistry and statistics
I am asking this question for a friend who knows a lot of chemistry and is now studying statistics, primarily since he heard this is the age of data and one should know statistics. However, he is interested to know if there are works on the…

Landon Carter
- 1,295
- 11
- 21
3
votes
1 answer
Dimensions of clustering results
My PhD research (computational organic chemistry) often generates large data sets (>10000 entires) of 'conformers' - where a conformer is basically the spatial arrangement of atoms in space.
From 10000 conformers, there are generally around 10…

James
- 31
- 1
2
votes
1 answer
Compare two samples?
Similar questions have been asked but have not managed to get a conclusion from them.
I am comparing two sets of samples, where ratios have been obtained for several analytes per sample. So the values are restricted to be in the [0,1] interval. One…

David
- 21
- 2
2
votes
1 answer
Determining whether new data is "in distribution" with training data
I'm hoping to use machine learning to predict chemical properties of various molecules. Many chemistry machine learning research papers that I come across talk about model generalizability issues related to new molecules that are not…

Ryan Zotti
- 5,927
- 6
- 29
- 33
2
votes
1 answer
Function to fit acid reaction curve
Data in the image below shows a variable DME reacting to the increase in acid concentration. It looks like it has a very fast increase from when the concentration is raised from 0 to 0.5 and then a rapid diminishing return.
I thought that a base 10…

Deathkill14
- 2,140
- 10
- 17
2
votes
0 answers
Adjacency matrix of amino-acid residue
How to get the mathematical representation of the chemical bonds in protein residue or the whole protein? Is it possible to get the connectivity matrix or a pairwise description?
The example in the photo: PyRosetta Manual, page 16. The full atom…

Pol Bel
- 21
- 2
2
votes
0 answers
Correlation between chemical analysis results
I have performed chemical analysis on a sample and the resulting data is something like this: "Carbon: 1.2%, Iron: 50%, Vanadium 2.3% ..." and so on.
I have 25 sets of similar data taken from other samples and I need to figure out which of these 25…

WarraChe
- 21
- 2
2
votes
0 answers
Is it possible to use PCA twice, first on several subsets of data, and then again on the main components of those subsets?
I am interested in understanding if it is possible to use PCA twice, first on several subsets of data, and then again on the main components of those data subsets. I'm not entirely sure if this will give me the answer I am looking for.
For example:…

Syd26S
- 21
- 2
2
votes
1 answer
Choice of statistical test - confirmation
I want to know if I'm choosing the right statistical test for my situation.
Description:
In a production process there are several places where wastewater is created (lets call these "spots", spot 1,2 and 3). At every spot 1 sample of water is taken…

Olivier_s_j
- 1,055
- 2
- 11
- 25
1
vote
1 answer
Graph Classification via Random Forest
This is my first post here, just a brief presentation: my name is Gianmarco, I’m Medicinal Chemistry undergraduate student who is preparing his dissertation, my idea would be to create a classifier that can distinguish anticancer drugs as active or…

Gianmarco Luchetti
- 111
- 1