Catalog
CERULEo dataset catalog is a collection of RUL estimation datasets ready to use. All datasets are exposed as AbstractTimeSeriesDataset
, enabling easy-to-use and transformation input pipelines. To get started see the guide and our list of datasets.
CMAPSS
CMAPSSDataset
Bases: AbstractPDMDataset
C-MAPSS Dataset
C-MAPSS stands for 'Commercial Modular Aero-Propulsion System Simulation' and it is a tool for the simulation of realistic large commercial turbofan engine data. Each flight is a combination of a series of flight conditions with a reasonable linear transition period to allow the engine to change from one flight condition to the next. The flight conditions are arranged to cover a typical ascent from sea level to 35K ft and descent back down to sea level.
The fault was injected at a given time in one of the flights and persists throughout the remaining flights, effectively increasing the age of the engine. The intent is to identify which flight and when in the flight the fault occurred.
Available models are:
- FD001
- FD002
- FD003
- FD004
Example
Parameters:
Name | Type | Description | Default |
---|---|---|---|
train |
bool
|
Weather to obtain the train data provided, by default True |
True
|
models |
Optional[Union[str, List[str]]]
|
Names of the models, by default None (all models) |
None
|
Source code in ceruleo/dataset/catalog/CMAPSS.py
obtain_raw_files(raw_data_path=DATASET_PATH)
Download and unzip the raw files
Parameters:
Name | Type | Description | Default |
---|---|---|---|
raw_data_path |
Path
|
Path where to store the dataset |
DATASET_PATH
|
Source code in ceruleo/dataset/catalog/CMAPSS.py
CMAPSS-2
CMAPSS2Dataset
Bases: AbstractPDMDataset
C-MAPSS-2 Dataset
The dataset provides a new realistic dataset of run-to-failure trajectories for a small fleet of aircraft engines under realistic flight conditions.
The damage propagation modelling used for the generation of this synthetic dataset builds on the modeling strategy from previous work . The dataset was generated with the Commercial Modular Aero-Propulsion System Simulation (C-MAPSS) dynamical model. The data set is been provided by the Prognostics CoE at NASA Ames in collaboration with ETH Zurich and PARC.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
train |
Optional[bool]
|
Wether to obtain the train data provided |
None
|
Source code in ceruleo/dataset/catalog/CMAPSS2.py
PHMDataset2018
FailureType
Bases: Enum
Failure types availables for the dataset.
Possible values are:
FailureType.FlowCoolPressureDroppedBelowLimit
FailureType.FlowcoolPressureTooHighCheckFlowcoolPump
FailureType.FlowcoolLeak
Source code in ceruleo/dataset/catalog/PHMDataset2018.py
PHMDataset2018
Bases: PDMDataset
PHM 2018 Dataset
The 2018 PHM dataset is a public dataset released by Seagate which contains the execution of 20 different ion milling machines. They distinguish three different failure causes and provide 22 features, including user-defined variables and sensors.
Three faults are present in the dataset
- Fault mode 1 occurs when flow-cool pressure drops.
- Fault mode 2 occurs when flow-cool pressure becomes too high.
- Fault mode 3 represents flow-cool leakage.
Example:
dataset = PHMDataset2018(
failure_types=FailureType.FlowCoolPressureDroppedBelowLimit,
tools=['01_M02']
)
Parameters:
Name | Type | Description | Default |
---|---|---|---|
path |
Path
|
Path where the dataset is located |
DATA_PATH
|
Source code in ceruleo/dataset/catalog/PHMDataset2018.py
66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 |
|
prepare_raw_dataset()
Download and unzip the raw files
Parameters:
Name | Type | Description | Default |
---|---|---|---|
path |
Path
|
Path where to store the raw dataset |
required |