Questions tagged [software]

Make sure your question is related to STATISTICS, MACHINE LEARNING, or DATA ANALYSIS and is not solely about programming, support, or bugs. Include additional tags referring to the statistical issues involved.

Computer software, or just software, is any set of machine-readable instructions (most often in the form of a computer program) that directs a computer's processor to perform specific operations. The term is used to contrast with computer hardware, the physical objects (processor and related devices) that carry out the instructions. Hardware and software require each other; neither has any value without the other.

Reference: Wikipedia

232 questions
167
votes
21 answers

Does Julia have any hope of sticking in the statistical community?

I recently read a post from R-Bloggers, that linked to this blog post from John Myles White about a new language called Julia. Julia takes advantage of a just-in-time compiler that gives it wicked fast run times and puts it on the same order of…
Christopher Aden
  • 1,775
  • 4
  • 24
  • 43
71
votes
19 answers

What are some valuable Statistical Analysis open source projects?

What are some valuable Statistical Analysis open source projects available right now? Edit: as pointed out by Sharpie, valuable could mean helping you get things done faster or more cheaply.
grokus
  • 233
  • 5
  • 10
66
votes
8 answers

Is the R language reliable for the field of economics?

I am a graduate student in economics who recently converted to R from other very well-known statistical packages (I was using SPSS mainly). My little problem at the moment is that I am the only R user in my class. My classmates use Stata and Gauss…
SavedByJESUS
  • 1,123
  • 3
  • 10
  • 14
62
votes
12 answers

Software needed to scrape data from graph

Anybody have any experience with software (preferably free, preferably open source) that will take an image of data plotted on cartesian coordinates (a standard, everyday plot) and extract the coordinates of the points plotted on the…
Alex Holcombe
  • 519
  • 1
  • 7
  • 9
56
votes
9 answers

How do R and Python complement each other in data science?

In many tutorials or manuals the narrative seems to imply that R and python coexist as complementary components of the analysis process. To my untrained eye, however, it seems that both languages sort of do the same thing. So my question is if there…
BioHazZzZard
  • 319
  • 1
  • 4
  • 5
56
votes
13 answers

Software for drawing bayesian networks (graphical models)

I am searching for [free] software that can produce nice looking graphical models, e.g. Any suggestions would be appreciated.
C. Reed
  • 537
  • 1
  • 8
  • 14
52
votes
8 answers

Excel as a statistics workbench

It seems that lots of people (including me) like to do exploratory data analysis in Excel. Some limitations, such as the number of rows allowed in a spreadsheet, are a pain but in most cases don't make it impossible to use Excel to play around with…
Carlos Accioly
  • 4,715
  • 4
  • 25
  • 28
51
votes
9 answers

Does anyone know any good open source software for visualizing data from database?

Recently I came across Tableau and tried to visualize the data from database and csv file. The user iterface enables the user to visualize time and spatial data and create plots in an instant. Such tool is really useful as it enables to observe the…
niko
  • 1,261
  • 3
  • 15
  • 18
44
votes
4 answers

OpenBugs vs. JAGS

I am about to try out a BUGS style environment for estimating Bayesian models. Are there any important advantages to consider in choosing between OpenBugs or JAGS? Is one likely to replace the other in the foreseeable future? I will be using the…
DanB
  • 898
  • 8
  • 13
41
votes
4 answers

For plotting with R, should I learn ggplot2 or ggvis?

For plotting with R, should I learn ggplot2 or ggvis? I don't necessarily want to learn both if one of them is superior in any regard. Why R community keeps creating new packages with overlapping functionalities? The introduction blog post does…
qazwsx
  • 707
  • 2
  • 7
  • 10
28
votes
7 answers

Why and when create a R package?

I understand this question is quite a broad one, but I wonder what should be the decisive points in deciding to create (or not) a new package for R. To be more specific, I would add that the question is not about the reasons to use R in itself, more…
27
votes
9 answers

Statistics and data mining software tools for dealing with large datasets

Currently I have to analyze approximately 20M records and build prediction models. So far I have tried out Statistica, SPSS, RapidMiner and R. Among these Statistica seems to be most suitable to deal with data mining and RapidMiner user interface is…
niko
  • 1,261
  • 3
  • 15
  • 18
26
votes
9 answers

What's a good tool to create Sankey diagrams?

How can I create Sankey diagrams?
Thilo
  • 533
  • 1
  • 5
  • 7
23
votes
4 answers

C++ libraries for statistical computing

I've got a particular MCMC algorithm which I would like to port to C/C++. Much of the expensive computation is in C already via Cython, but I want to have the whole sampler written in a compiled language so that I can just write wrappers for…
JMS
  • 4,660
  • 1
  • 22
  • 32
20
votes
14 answers

Software for easy-yet-robust data exploration

In my attempts to fight spreadsheet mayhem, I am often evangelical in pushing for more robust tools such as true statistics software (R, Stata, and the like). Recently, I was challenged on this view by someone who stated flat out that they simply…
Ari B. Friedman
  • 3,421
  • 4
  • 33
  • 42
1
2 3
15 16