Return value from dataframe with multiple criteria

alvisv · October 8, 2022, 5:59pm

I’m new to python, but my main point was to switch my calculations from excel to panda df My concern is that this line of code executes too long

TRX["group_code"] = np.dot((TRX["lookup"].values[:,None]==bcodes["ID_lookup"].values)& (TRX["completed_at / operation_completed_at"].values[:, None] >= bcodes["Effective from"].values)&(TRX["completed_at / operation_completed_at"].values[:, None] <= bcodes["Effective till"].values) ,bcodes["Code"])

TRX is dataframe with 298515 rows × 44 columns and bcodes is dataframe with 6960 rows × 41 columns

I timed this line of code: 1min 43s ± 0 ns per loop (mean ± std. dev. of 1 run, 1 loop each

Even when i left only 1 criteria to lookup( without dates) it still runs 1 min 37s

Im pretty sure there should be better/faster way to get the result.

tjreedy · October 10, 2022, 5:56am

You might have better luck asking questions about 3rd party modules on stackoverflow.com.

alvisv · October 10, 2022, 7:14am

Thanks, i already found better solution- merge dfs on lookup and then filter df with date criteria

Topic		Replies	Views
Column and Rows condition criteria Python Help help	1	491	February 12, 2021
Return a value from a multi-col list based on single value being between 2 numbers on that list Python Help	3	219	July 24, 2023
How to get value in Dataframe given row & column values? Python Help	5	1682	May 2, 2021
Performance tuning Python Help	4	549	June 2, 2020
Mapping with Two Conditions and a Calculation Python Help	0	302	May 10, 2022

Return value from dataframe with multiple criteria

Related Topics