Pypi Link Build Status Coverage Status Documentation Status

pd_multiprocessing

pd_multiprocessing provides a simple, parallelized function to apply a user defined function rowwise on a Pandas Dataframe.

Requirements

Documentation

If you want to build the documentation, you need the following packages:

  • Sphinx
  • sphinx_rtd_theme
  • m2r

Installation

You can easily install pd_multiprocessing via

pip install pd-multiprocessing

Usage

A typical usage looks like this

import pandas as pd
from pd_multiprocessing.map import df_map


def twotimes(row):
    row['col2'] = row['col1']*2
    return row


if __name__ == '__main__':
    df = pd.DataFrame.from_dict({'col1': range(100)})
    print(df_map(twotimes, df))

Documentation

For the documentation please see https://pd-multiprocessing.readthedocs.io/en/latest/.

Bugs/Request

Please use the GitHub issue tracker to submit bugs or request features.

Changelog

Consult the Changelog page for fixes and enhancements of each version.

License

Copyright Kyriakos Stavrakidis, 2019.

Distributed under the terms of the MIT license, pd_multiprocessing is free and open source software.