Questions tagged [terminology]

Usage and meaning of specific technical words/concepts in statistics.

From computing.surrey.ac.uk

Three major points:

  1. Firstly, proper terminology is concerned with the relationship between concepts, and between them and their designations, rather than with designations alone or with the objects they represent. This point is essential if quality is to be achieved, especially with synonyms and in multilingual environments.
  2. Secondly, a designation does not necessarily have to be a word or phrase, although it often is. Thus terminological resources may comprise symbols, drawings, formulae, codes, etc. as well as, or even instead of, words. This point is especially important given the move to multimedia systems.
  3. Thirdly, terminology is inextricably linked with specialist knowledge and hence with special languages or languages for special purposes (LSPs).
1530 questions
607
votes
12 answers

What is the difference between "likelihood" and "probability"?

The wikipedia page claims that likelihood and probability are distinct concepts. In non-technical parlance, "likelihood" is usually a synonym for "probability," but in statistical usage there is a clear distinction in perspective: the number that…
Douglas S. Stones
  • 6,931
  • 4
  • 16
  • 18
270
votes
6 answers

What is batch size in neural network?

I'm using Python Keras package for neural network. This is the link. Is batch_size equals to number of test samples? From Wikipedia we have this information: However, in other cases, evaluating the sum-gradient may require expensive evaluations…
user2991243
  • 3,621
  • 4
  • 22
  • 48
192
votes
14 answers

What is a data scientist?

Having recently graduated from my PhD program in statistics, I had for the last couple of months began searching for work in the field of statistics. Almost every company I considered had a job posting with a job title of "Data Scientist". In fact,…
RustyStatistician
  • 1,709
  • 3
  • 13
  • 35
150
votes
9 answers

Objective function, cost function, loss function: are they the same thing?

In machine learning, people talk about objective function, cost function, loss function. Are they just different names of the same thing? When to use them? If they are not always refer to the same thing, what are the differences?
Bin
  • 1,619
  • 3
  • 10
  • 9
131
votes
4 answers

Difference between neural net weight decay and learning rate

In the context of neural networks, what is the difference between the learning rate and weight decay?
Ryan Zotti
  • 5,927
  • 6
  • 29
  • 33
114
votes
4 answers

What does a "closed-form solution" mean?

I have come across the term "closed-form solution" quite often. What does a closed-form solution mean? How does one determine if a close-form solution exists for a given problem? Searching online, I found some information, but nothing in the context…
114
votes
16 answers

What misused statistical terms are worth correcting?

Statistics is everywhere; common usage of statistical terms is, however, often unclear. The terms probability and odds are used interchangeable in lay English despite their well-defined and different mathematical expressions. Not separating the term…
Antoni Parellada
  • 23,430
  • 15
  • 100
  • 197
107
votes
2 answers

What is covariance in plain language?

What is covariance in plain language and how is it linked to the terms dependence, correlation and variance-covariance structure with respect to repeated-measures designs?
abc
  • 1,747
  • 3
  • 17
  • 32
107
votes
12 answers

When should linear regression be called "machine learning"?

In a recent colloquium, the speaker's abstract claimed they were using machine learning. During the talk, the only thing related to machine learning was that they perform linear regression on their data. After calculating the best-fit coefficients…
102
votes
4 answers

Why isn't Logistic Regression called Logistic Classification?

Since Logistic Regression is a statistical classification model dealing with categorical dependent variables, why isn't it called Logistic Classification? Shouldn't the "Regression" name be reserved to models dealing with continuous dependent…
98
votes
30 answers

Is there a way to remember the definitions of Type I and Type II Errors?

I'm not a statistician by education, I'm a software engineer. Yet statistics comes up a lot. In fact, questions specifically about Type I and Type II error are coming up a lot in the course of my studying for the Certified Software Development…
Thomas Owens
  • 1,091
  • 1
  • 10
  • 19
95
votes
6 answers

What is the difference between Multiclass and Multilabel Problem

What is the difference between a multiclass problem and a multilabel problem?
Learner
  • 4,007
  • 11
  • 37
  • 39
90
votes
15 answers

What do you call an average that does not include outliers?

What do you call an average that does not include outliers? For example if you have a set: {90,89,92,91,5} avg = 73.4 but excluding the outlier (5) we have {90,89,92,91(,5)} avg = 90.5 How do you describe this average in statistics?
Tawani
  • 1,003
  • 1
  • 7
  • 5
85
votes
11 answers

What is the best way to remember the difference between sensitivity, specificity, precision, accuracy, and recall?

Despite having seen these terms 502847894789 times, I cannot for the life of me remember the difference between sensitivity, specificity, precision, accuracy, and recall. They're pretty simple concepts, but the names are highly unintuitive to me,…
Jessica
  • 1,781
  • 2
  • 15
  • 17
81
votes
5 answers

Explain the difference between multiple regression and multivariate regression, with minimal use of symbols/math

Are multiple and multivariate regression really different? What is a variate anyways?
1
2 3
99 100