Development of an Automatic Pipeline for Data Analysis and Pre-Processing for Data Driven-Based Engine Emission Modeling in a Real Industrial Application

Boris Petrone; Emanuele Giovannardi; Alessandro Brusa; Nicolò Cavina; Ioannis Kitsopanidis

doi:10.4271/2024-01-2018

Features

Event: WCX SAE World Congress Experience

Authors

Boris Petrone

University of Bologna

Emanuele Giovannardi

University of Bologna

Alessandro Brusa

University of Bologna

Nicolò Cavina

University of Bologna

Ioannis Kitsopanidis

Ferrari Spa

Abstract

Content: During the development of an Internal Combustion Engine-based powertrain, traditional procedures for control strategies calibration and validation produce huge amount of data, that can be used to develop innovative data-driven applications, such as emission virtual sensing. One of the main criticalities is related to the data quality, that cannot be easily assessed for such a big amount of data. This work focuses on an emission modeling activity, using an enhanced Light Gradient Boosting Regressor and a dedicated data pre-processing pipeline to improve data quality. First thing, a software tool is developed to access a database containing data coming from emissions tests. The tool performs a data cleaning procedure to exclude corrupted data or invalid parts of the test. Moreover, it automatically tunes model hyperparameters, it chooses the best set of features, and it validates the procedure by comparing the estimation and the experimental measurement. The proposed pre-processing pipeline shows an improvement in terms of accuracy, demonstrating the utility of using large training data which cover a wide set of vehicle maneuvers. Thus, custom designed tests are performed for dataset enrichment, allowing the model to predict non-conventional conditions of aftertreatment systems inefficiency. Real case applications of the proposed model are exposed, such as emission estimation in non-measurable conditions, virtual assessment of the impact of new control strategy calibration on emissions, alignment of emission measurements with all other vehicle signals. Finally, a Principal Component Analysis-based algorithm is developed, to assess the epistemic uncertainty of the model and the prediction reliability during inference.

Meta Tags

Topics: Exhaust emissions
Emissions measurement
Environmental testing
Mathematical models
Simulation and modeling
Emissions
Combustion and combustion processes
Calibration
Measurements
Education and training

Affiliated or Co-Author: University of Bologna
Ferrari Spa

Details

DOI: https://doi.org/10.4271/2024-01-2018

Citation: Petrone, B., Giovannardi, E., Brusa, A., Cavina, N., et al., "Development of an Automatic Pipeline for Data Analysis and Pre-Processing for Data Driven-Based Engine Emission Modeling in a Real Industrial Application," WCX SAE World Congress Experience, Detroit, Michigan, United States, April 16, 2024, https://doi.org/10.4271/2024-01-2018.

Additional Details

Publisher: SAE International

Published: 4/9/2024

Product Code: 2024-01-2018

Content Type: Technical Paper

Language: English