pd_multiprocessing¶
pd_multiprocessing provides a simple, parallelized function to apply a user defined function rowwise on a Pandas Dataframe.
Requirements¶
Documentation¶
If you want to build the documentation, you need the following packages:
- Sphinx
- sphinx_rtd_theme
- m2r
Usage¶
A typical usage looks like this
import pandas as pd
from pd_multiprocessing.map import df_map
def twotimes(row):
row['col2'] = row['col1']*2
return row
if __name__ == '__main__':
df = pd.DataFrame.from_dict({'col1': range(100)})
print(df_map(twotimes, df))
Documentation¶
For the documentation please see https://pd-multiprocessing.readthedocs.io/en/latest/.
Bugs/Request¶
Please use the GitHub issue tracker to submit bugs or request features.
License¶
Copyright Kyriakos Stavrakidis, 2019.
Distributed under the terms of the MIT license, pd_multiprocessing is free and open source software.