Early detection of anomalies, trends and emerging patterns can be exploited to reduce the number and severity of quality problems in vehicles. This is crucially important since having a good understanding of the quality of the product leads to better designs in the future, and better maintenance to solve the current issues. To this end, the integration of large amounts of data that are logged during the vehicle operation can be used to build the model of usage patterns for early prediction. In this study, we have developed a machine learning system for warranty claims by integrating available information sources: Logged Vehicle Data (LVD) and Warranty Claims (WCs). The experimental results obtained from a large data set of heavy duty trucks are used to demonstrate the effectiveness of the proposed system to predict the warranty claims. © Springer Nature Switzerland AG 2019.