extend_by_non_matched

extend_by_non_matched#

powerplantmatching.heuristics.extend_by_non_matched(df, extend_by, label=None, query=None, aggregate_added_data=True, config=None, **aggkwargs)#

Returns the matched dataframe with additional entries of non-matched powerplants of a reliable source.

Parameters:
  • df (Pandas.DataFrame) – Already matched dataset which should be extended

  • extend_by (pd.DataFrame | str) – Database which is partially included in the matched dataset, but which should be included totally. If str is passed, is will be used to call the corresponding data from data.py

  • label (str) – Column name of the additional database within the matched dataset, this string is used if the columns of the additional database do not correspond to the ones of the dataset