extend_by_non_matched#
- powerplantmatching.heuristics.extend_by_non_matched(df, extend_by, label=None, query=None, aggregate_added_data=True, config=None, **aggkwargs)#
Returns the matched dataframe with additional entries of non-matched powerplants of a reliable source.
- Parameters:
df (Pandas.DataFrame) – Already matched dataset which should be extended
extend_by (pd.DataFrame | str) – Database which is partially included in the matched dataset, but which should be included totally. If str is passed, is will be used to call the corresponding data from data.py
label (str) – Column name of the additional database within the matched dataset, this string is used if the columns of the additional database do not correspond to the ones of the dataset