imbalance
ImbalanceGenerator
Bases: GeneratorMixin
Base class for transformers that makes tabular data imbalanced
Source code in badgers/generators/tabular_data/imbalance.py
11 12 13 14 15 16 17 18 19 20 21 22 23 24 |
|
__init__(random_generator=default_rng(seed=0))
:param random_generator: A random generator
Source code in badgers/generators/tabular_data/imbalance.py
16 17 18 19 20 |
|
RandomSamplingClassesGenerator
Bases: ImbalanceGenerator
Randomly samples data points within predefined classes
Source code in badgers/generators/tabular_data/imbalance.py
56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 |
|
__init__(random_generator=default_rng(seed=0), proportion_classes=None)
:param random_generator: A random generator :param proportion_classes: Example for having in total 50% of class 'A', 30% of class 'B', and 20% of class 'C' proportion_classes={'A':0.5, 'B':0.3, 'C':0.2}
Source code in badgers/generators/tabular_data/imbalance.py
61 62 63 64 65 66 67 68 69 70 |
|
generate(X, y, **params)
Randomly samples instances for each classes
:param X: :param y: :param params: :return:
Source code in badgers/generators/tabular_data/imbalance.py
72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 |
|
RandomSamplingFeaturesGenerator
Bases: ImbalanceGenerator
Source code in badgers/generators/tabular_data/imbalance.py
27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 |
|
__init__(random_generator=default_rng(seed=0), sampling_proba_func=lambda X: normalize_proba(X[:, 0]))
:param random_generator: A random generator :param sampling_proba_func: A function that takes as input data and returns a sampling probability
Source code in badgers/generators/tabular_data/imbalance.py
29 30 31 32 33 34 35 36 |
|
generate(X, y=None, **params)
Randomly samples instances based on the features values in X
:param X: :param y: :return: Xt, yt
Source code in badgers/generators/tabular_data/imbalance.py
38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 |
|
RandomSamplingTargetsGenerator
Bases: ImbalanceGenerator
Randomly samples data points
Source code in badgers/generators/tabular_data/imbalance.py
97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 |
|
__init__(random_generator=default_rng(seed=0), sampling_proba_func=lambda y: normalize_proba(y))
:param random_generator: A random generator :param sampling_proba_func: A function that takes y as input and returns a sampling probability
Source code in badgers/generators/tabular_data/imbalance.py
102 103 104 105 106 107 108 109 110 |
|
generate(X, y, **params)
Randomly samples instances for each classes
:param X: :param y: :return:
Source code in badgers/generators/tabular_data/imbalance.py
112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 |
|