User:Josefpktd/Statsmodels

Statsmodels ...

scikits.statsmodels is a Python package for statistical analysis and econometrics. It is based on Numpy and Scipy and can be used as a library or for interactive scripting. It covers many basic statistical models for estimation, descriptive statistics and statistical tests. Some basic models that we would expect in a general statistical package are still missing.

The origin of statsmodels is a python package originally written by Jonathan Taylor, which was for some time included in scipy. During Google Summer of Code 2009, Skipper Seabold and Josef Perktold corrected, tested and enhanced stats.models and released it as a new package. Since then, the coverage of statistical and econometric methods has been expanded, the 0.3 release saw the addition of basic time series models, with Wes McKinney as major new contributor,.

Features
The latest release includes the following features


 * Linear models: Generalized least squares (including weighted least squares and least squares with autoregressive errors), ordinary least squares.
 * Generalized linear models with support for all of the one-parameter exponential family distributions.
 * Discrete choice models such as Poisson, Probit, Logit, Multinomial Logit based on maximum likelihood estimation.
 * Robust linear models with support for several M-estimators.
 * Time series analysis: includes descriptive statistics, autoregressive models (AR), autoregressive moving average models (ARMA) and vector-autoregressive models (VAR)
 * (Univariate) kernel density estimators
 * some datasets that are used for examples and in testing.
 * PyDTA: Tools for reading Stata .dta files into numpy arrays.
 * statistical tests and regression diagnostic tests