|
Project Information
Featured
Downloads
Links
|
AugustusAugustus is an open source system for building and scoring statistical models designed to work with data sets that are too large to fit into memory. Augustus is now available under the Apache Software License. Older versions will remain on the GNU GPL v2 open source license. Quick LinksTo get started, we offer an Installation Guide and a Modeling Primer which covers Augustus examples.
0.5.2.0 ReleaseAugustus 0.5.2.0 is now available. The source can be checked out at tags/augustus-0.5.2.0. This release includes Naive-Bayes and Regression models as well as the previously released Tree, Baseline, Cluster, and Ruleset models. Tree and Cluster models have been improved. "Custom processing", a way to use Augustus features with fewer constraints, is also included. Augustus 0.5.0.0 represents a substantial change for Augustus. Please refer to Augustus 0.5 Overview for more information about the release. There are two examples, gaslog and email, under augustus-examples which use the updated configuration and demonstrate some of the new features. PMMLPredictive Model Markup Language (PMML) is an XML mark up language to describe statistical and data mining models. PMML describes the inputs to data mining models, the transformations used to prepare data for data mining, and the parameters which define the models themselves. It is used for a wide variety of applications, including applications in finance, e-business, direct marketing, manufacturing, and defense. PMML is often used so that systems which create statistical and data mining models ("PMML Producers") can easily inter-operate with systems which deploy PMML models for scoring or other operational purposes ("PMML Consumers"). Open DataOpen Data Group specializes in building predictive models over big data and is one of the pioneers using technologies such as Hadoop and NoSQL databases so that companies can build predictive models efficiently over all of their data. ODG provides management consulting services, outsourced analytical services, analytic staffing, and expert witnesses broadly related to data and analytics. It has experience with customer data, supplier data, financial and trading data, and data from internal business processes. It has staff in Chicago and clients throughout the U.S. Open Data Group began operations in 2002. Open Data employs and contributes open source software whenever it can. |
||||||||||




