Questions tagged [internet]

Global system of interconnected computer networks serving billions of users worldwide.

The Internet is a global system of interconnected computer networks that use the standard Internet Protocol Suite (TCP/IP) to serve billions of users worldwide. It is a network of networks that consists of millions of private, public, academic, business, and government networks, of local to global scope, that are linked by a broad array of electronic, wireless and optical networking technologies. The Internet carries a vast range of information resources and services, such as the inter-linked hypertext documents of the World Wide Web (WWW) and the infrastructure to support electronic mail.

Reference: Stack Overflow SE

46 questions
74
votes
6 answers

Model for predicting number of Youtube views of Gangnam Style

PSY's music video "Gangnam style" is popular, after a little more than 2 months it has about 540 million viewers. I learned this from my preteen children at dinner last week and soon the discussion went in the direction of if it was possible to do…
FredrikD
  • 843
  • 7
  • 15
18
votes
2 answers

What distribution is most commonly used to model server response time?

I have a servlet-based application wherein I measure the time taken to complete each request to that servlet. I already compute simple statistics like the mean and maximum; I'd like to produce some more sophisticated analysis however, and to do so I…
Sean Owen
  • 311
  • 4
  • 10
14
votes
1 answer

Do unique visitors to a website follow a power law?

Suppose I have an ordered vector where the first element is the number of visits to a website in a given period of time by the unique IP with the highest number of visits, the second element is the number of visits by the unique IP with the second…
Jon Smark
  • 225
  • 2
  • 9
11
votes
6 answers

Is there a site to post my survey to so I can get a sample representative of the population?

This is only for my high school senior project, so it doesn't have to be perfect. I'm doing a project on global warming, and I want to survey people for their opinions. I know that if I use a convenience sample of my classmates, I'll have a lot of…
George
  • 113
  • 1
  • 1
  • 4
8
votes
2 answers

Math behind multivariate testing for website optimization

I am looking for theoretical resources (books, tutorials, etc.) to learn about making sound statistical inferences given (plenty of) multivariate website conversion data. I'm after the math involved, and cannot find any good non-marketing stuff on…
8
votes
2 answers

Why is a lognormal distribution a good fit for server response times?

I found out that webserver response times are typically modeled as coming from a lognormal distribution here. What I don't quite get is why this is the case! In particular, Wikipedia states that a random variable X is distributed lognormally when it…
maliky0_o
  • 223
  • 1
  • 5
8
votes
4 answers

Internet statistics resources suitable for psychology students doing research

I am working on a new webpage for my part-time job as a methodological/statistical consultant for (psychology) students at my university. On this website I would like to place several links to online recourses for clients to consult themselves. So I…
Sacha Epskamp
  • 287
  • 1
  • 13
7
votes
2 answers

What could be a statistical test for comparing funnel data before and after important changes?

I have some funnel data from a website I manage. After some time I did some improvement on a multi-step purchase process. I wonder if it's possible to formulate a test that with statistical significance can assess if the improvements in the process…
5
votes
2 answers

Estimating demographics from internet activity

I'm looking for research papers on estimating demographic information (age, wealth, location, gender etc.) out of internet activities (preferably social network activity or browser history). Where would be a good place to look for?
Enno Shioji
  • 459
  • 3
  • 10
5
votes
1 answer

Can you recommend an online survey platform for 5k+ participants?

I am planning to perform an online behavioral survey across a nationwide sample, and I expect several thousand responses. I expect to have not that many questions (perhaps 3 pages, 8 qs each), require skip logic, and would like it to redirect back…
patrickmdnet
  • 359
  • 1
  • 2
  • 10
4
votes
1 answer

Identifying a minimum time on web page as proxy for successful search outcome

Given a correlation between two sequences of numbers, I need to find to what point in one sequence the correlation is strongest. Here's why: I'm tracking large numbers of pages in a knowledge base, and flagging certain pages for followup. For…
Joe Pairman
  • 205
  • 1
  • 4
4
votes
2 answers

Testing significance of a data point against an data set

Background I am currently testing conversions on a web page and would like to test if the changes I made yesterday produced a significant change in conversions. The Data Set For each day I test, I have the total number of visitors to the site and…
Hzmy
  • 141
  • 1
4
votes
4 answers

High-Frequency Time-Series Forecast With A Lower Bound

I am helping a friend with a data project. He's interested in building a canary-in-the-coal-mine alert system for his website which tells him when the number of users dips below some critical lower bound. The number of users varies by time of day,…
dimitriy
  • 31,081
  • 5
  • 63
  • 138
4
votes
1 answer

Is Machine Learning viable for Extracting product Information from webpages?

I have a task to extract product information from a certain set of websites for price analysis. The product group I'm trying to harvest data is well defined, I could easily provide a set with all the product names and brands I want to obtain price…
4
votes
1 answer

Controlling for biased audiences in online surveys

For a long time I have been meaning to set up a bunch of online surveys asking a whole range of social questions and publish the results. I am well aware that there are various difficulties to overcome in this goal (e.g. privacy, excluding bots…
1
2 3 4