Questions tagged [fraud-detection]

Fraud detection using statistical and machine learning methods

Fraud detection is a set of activities undertaken to detect money or property being obtained through false pretenses. It is common in banking (e.g. use of stolen credit cards or money laundering) or insurance (exaggerating losses or causing an accident with the sole intent for the payout). Fraud detection can be done by classification and/or outlier detection algorithms of machine learning .

47 questions
24
votes
2 answers

Statistical forensics: Benford and beyond

What broad methods are there to detect fraud, anomalies, fudging, etc. in scientific works produced by a third party? (I was motivated to ask this by the recent Marc Hauser affair.) Usually for election and accounting fraud, some variant of…
shabbychef
  • 10,388
  • 7
  • 50
  • 93
12
votes
2 answers

Can p-values for Pearson's correlation test be computed just from correlation coefficient and sample size?

Background: I read one article where authors report Pearson correlation 0.754 from sample size 878. Resulting p-value for correlation test is "two star" significant (i.e. p < 0.01). However, I think that with such a large sample size, corresponding…
sitems
  • 3,649
  • 1
  • 25
  • 52
9
votes
2 answers

How is election fraud by ballot stuffing possible?

An NPR story on the upcoming Russian presidential election mentioned that 5% of polling sites would be equipped with new electronic ballot boxes that would reject attempts to submit multiple ballots. There was "concern" that this would not do…
Ben Jackson
  • 251
  • 1
  • 3
  • 9
8
votes
1 answer

Would you flag this data as fraudulent?

Let's suppose you have been given some data from a randomized block design with 4 repetitions and 23 treatments. After an initial inspection of the data, you notice that for 8 treatments all repetitions are identical, which is obviously wrong. After…
Teo
  • 81
  • 2
6
votes
1 answer

Any good reference books/material to help me build a txn level fraud detection model?

I am looking for a book/case study etc on how to build a fraud detection model at the transaction level. Something applied rather than theoretical would be really helpful.
6
votes
1 answer

Identifying fraudulent questionnaires

Questionaires are often used in social sciences. Many people try to complete them very quickly and very often they only "guess" answers. Is there any statistical technique or any research in this area, how to identify which questionnaires are…
sitems
  • 3,649
  • 1
  • 25
  • 52
5
votes
1 answer

data mining methods/algorithms for fraud case

I recently got into a topic regarding fraudulent transactions. I am relatively new to data mining and just looking for some input for my case here. I started with a cluster analysis / anomaly detection and got some good results in the first step.…
Anghostdy
  • 93
  • 5
4
votes
2 answers

Medical Insurance Fraud Detection: Text analysis

I'm trying to analyse a dataset to detect fraudulent insurance claims. Unfortunately, other than basic demographics the rest of the claim is a free format OCR scanned text file made from documents submitted by the patient. These documents include…
curious_cat
  • 1,043
  • 10
  • 28
4
votes
2 answers

Feature engineering for fraud detection

I'm doing some research into fraud detection for academic purposes. I' d like to know specifically about techniques for feature selection\engeneering from a transactional dataset. In more details, given a dataset of transactions (credit card for…
4
votes
0 answers

How to retrain a production classifier that blocks its own positive examples?

I'm looking for help understanding how to re-train a fraud detection classifier that's been deployed to production (where it successfully blocked much, but not all fraud coming into the system). I haven't seen this subtlety addressed in papers on…
4
votes
2 answers

Election forensics using statistical methods in practice?

Are you aware of any examples of election forensics in practice? Or at least any applied research on real large-scale datasets (e.g. govermental elections)? Thanks.
Alexey Kalmykov
  • 228
  • 1
  • 6
3
votes
2 answers

Machine Learning problem - identifying fake fraudulent names

I have a dataset of fraudulent orders from some business. Each order has a bunch of features such as order_amount, address, state, city, phone_number, and name. Obviously a criminal would not be using his/her real name when making a fraudulent…
user1893354
  • 1,435
  • 4
  • 15
  • 25
3
votes
2 answers

How to apply clustering analysis to help identify criminal entities out from credit card usage data?

Let's say I have a ton of credit card usage data and have also some means to predict if a given transaction is fraudulent. Now I want to know what kind of criminal entities are behind these frauds. Maybe some are big international/domestic crime…
Enno Shioji
  • 459
  • 3
  • 10
3
votes
2 answers

Strange election results and probability of election fraud

Suppose an election is held for the leadership position in a major political party. Four candidates are running. After the election, the following results are announced: Candidate A: 160823 votes Candidate B: 115162 votes Candidate C:…
nikosd
  • 409
  • 3
  • 8
3
votes
0 answers

Reading recommendation on using statistical analysis in online fraud prevention

Can you please recommend good reads on statistical analysis related to online fraud detection and prevention of account abuse? Thank you.
notrockstar
  • 203
  • 2
  • 4
1
2 3 4