Copper

Wrapper around python data analysis packages such as pandas, scikit-learn and matplotlib to make data analysis on python easier.

Requirements

Python
pandas
scikit-learn
matplotlib
rpy2
tornado

Note: pandas is the only package that is required before installing copper, but is recommended to have all other packages installed too.

Note 2: The package is developed for Python 3 and Python 2 with a single code base. But the main target is Python 3 so is recommended since most packages already support Python 3.

Install

pip install copper

Features

Project structure for Data Analysis projects ala Project Template on R.
Dataset: Wrapper around pandas.DataFrame to introduce metadata
Data transformation templates
Custom matplotlib charts for exploration: histograms, scatterplots
Exploration via D3.js (very experimental)
More data imputation options via R (rpy2)
Rapid Machine Learning prototyping:
- Easy to compare classifiers
- Ensemble (bagging)

Project Structure

Copper uses a project structure based on Project Template (from R) to give structure to a Data Analysis project.

The suggested structure is:

data -> `project/data': All the data files, raw, cached, etc.

Is suggested to use /data/raw for raw files such as .csv files.

Copper by default loads data from the data folder. For example: data = copper.read_csv('catalog.csv') will load the project/data/catalog.csv file into a pandas.DataFrame using the pandas read_csv method and parameters.

As expected when saving files (copper.save(...) or copper.export(...)) copper saves the files on the data folder

source -> src: Python, iPython notebook files.

Following the intuition every file inside the source folder should do:

import copper
copper.project.path = '../'

For other suggested folders see: Project Template

For more info about this see the examples below.

Examples

Donors:

Project structure and histograms

Loans:

Automatic data transformation

Catalog:

Custom transformation and basic machine learning

Kaggle Bulldozers:

Post: Basic feature selection and join Datasets

For more information and more examples (but some are possible outdated) can see my blog: danielfrg.github.com

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
copper		copper
dist		dist
.gitignore		.gitignore
CHANGES.txt		CHANGES.txt
LICENSE.txt		LICENSE.txt
MANIFEST		MANIFEST
README.md		README.md
README.txt		README.txt
copper.sublime-project		copper.sublime-project
copper.sublime-workspace		copper.sublime-workspace
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Copper

Requirements

Install

Features

Project Structure

Examples

About

Releases

Packages

License

hnfgns/copper

Folders and files

Latest commit

History

Repository files navigation

Copper

Requirements

Install

Features

Project Structure

Examples

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages