Driving Style Recognition at First Impression for Online Trajectory Prediction
Abstract
This paper proposes a new driving style recognition approach that allows autonomous vehicles (AVs) to perform trajectory predictions for surrounding vehicles with minimal data. Toward that end, we use a hybrid of offline and online methods in the proposed approach. We first learn typical driving styles with PCA and K-means algorithms in the offline part. After that, local Maximum-Likelihood techniques are used to perform online driving style recognition. We benchmarked our method on a real driving dataset against other methods in terms of the RMSE value of the predicted trajectory and the observed trajectory over a 5s duration. The proposed approach can reduce trajectory prediction error by up to 37.7% compared to using the parameters from other literature and up to 24.4% compared to not performing driving style recognition.
keywords:
Modeling and simulation of transportation systems, Trajectory prediction, Driving style, Car-following, Maximum Likelihood1 Introduction
Predicting the trajectory of surrounding traffic participants is a critical predecessor task for autonomous driving motion planning. For safety purposes, trajectory prediction needs to be precise and computationally efficiently (Leon and Gavrilescu, 2021), making this task difficult. Vehicles’ driving behavior is highly uncertain, and it could be impacted by various observable and unobservable factors, including traffic signs, road environment, and drivers’ psychological and physiological factors, which can be referred to as “driving style”. An experienced driver would form first impressions of surrounding vehicles’ driving style, usually after a short period of observation, even at a glance, and the driving style recognition helps the drivers to categorize their prediction on neighbor vehicles’ trajectory to help do their motion planning.
Driving style is a complex concept (Taubman-Ben-Ari et al., 2004; Ericsson, 2000). It is a common approach to study driving style from speed profile by interpreting driving activities such as acceleration, deceleration, turning and lane changing (Martinez et al., 2017). Common driving style recognition methods include rule-based algorithms to allocate driving behavior into groups based on predefined thresholds (Murphey et al., 2009), fuzzy-logic regression (Guo et al., 2019), unsupervised machine learning algorithms such as principal component analysis (PCA) (Constantinescu et al., 2010) and K-means clustering (Gao et al., 2020), supervised machine learning algorithms such as k-nearest neighbors (kNN) (Vaitkus et al., 2014), support vector machine (SVM) (Xue et al., 2019) and artificial neural networks (ANN) (Xu et al., 2015).
In terms of trajectory prediction, there are generally two types of approaches, including data-driven and rule-based methods. Data-driven methods such as Gaussian mixture models and neural networks (Sheng et al., 2022; Hou et al., 2020; Cui et al., 2019) often have the expressive capability to capture the subtle differences among various driving styles. However, the data-driven approaches are like black-box and lack interpretability, and they could sometimes make unrealistic predictions and threaten ego vehicle’s driving safety (Bhattacharyya et al., 2022). Moreover, data-driven methods are usually more computing-intensive, requiring more input data and longer computing time than rule-based methods. For autonomous driving under single vehicle intelligence (in contrast to connected vehicles, which can get beyond-visual-range information through vehicle-to-vehicle or vehicle-to-road communications), on-board sensors can hardly observe the exact neighboring vehicle for a long period, except the leading vehicle in adaptive cruise control mode. Thus, AVs must make online trajectory predictions with a minimal amount of observation data.
Rule-based methods apply driver models, e.g., the IDM (Treiber et al., 2000), the Gipps’ Model (Gipps, 1981), and the Newell’s Model (Newell, 2002), to fit observed car-following and lane-changing behaviors and to make trajectory prediction. Since rule-based models take advantage of expert knowledge, they can make relatively robust trajectory predictions even with underrepresented data (Bhattacharyya et al., 2020). The precision of rule-based models relies on the parameter estimation of driver models. Model parameters can be chosen either offline or online. Offline approaches can use a relatively larger amount of data to generate a set of average parameters to portray typical driving style, and there is no computing time constraint. Online approaches use real-time observation data to calibrate model parameters better to fit individual characteristics (Monteil et al., 2015). According to a survey paper (Punzo et al., 2021), only a small fraction of research attempted to calibrate car-following models.
This paper proposes a new approach as a hybrid of offline and online methods, allowing us to perform trajectory prediction with limited observation data. Our approach can take advantage of an arbitrary amount of historical data to learn typical driving styles so that it can form a first impression of a newly observed vehicle’s driving style with a significantly shorter observation period, just like an experienced driver. Moreover, it will dynamically correct its driving style recognition and accumulate sensor data about the exact surrounding vehicle to perform a precise trajectory prediction at relatively low computing costs.
2 Methodology
The proposed trajectory prediction approach consists of offline and online parts. For the offline part, an arbitrary amount of historical trajectory data are used to learn typical driving styles. Then, the Intelligent Driving Model (IDM) car-following parameters for each driving style are estimated. The clustering analysis results and the IDM parameter value sets help AVs to form a first impression of a newly observed vehicle’s driving style with a significantly shorter observation period in the online part. For the online part, given the trajectory data of a surrounding vehicle , one can find the IDM parameter value set that best fits the trajectory. The driver is then classified into the corresponding driving style cluster . One thing to note is that, along with the accumulation of trajectory data of vehicle , the approach can dynamically correct its driving style recognition. The framework of the proposed approach is shown in Fig. 1.

3 Experiments
3.1 Data
We evaluate the performance of our driving style recognition methods on demonstration data from a real-world dataset, namely the reconstructed I80-1 dataset (from 4.00 p.m. to 4.15 p.m.) (Montanino and Punzo, 2015; Punzo et al., 2011), which provides the X, and Y coordinates of each vehicle at 10Hz. In this paper, we perform driving style recognition only based on car-following features of vehicles. Toward that end, car-following pairs (leader-follower) were extracted from the original dataset. In each car-following pair, the car-following driving condition between the leader and the follower lasts for at least 15 seconds without any lane changing or interference from other vehicles. We extracted 833 car-following pairs from the inner four lanes in total. We use 80% of them (666 pairs) in the offline part (referred to as dataset 1 hereafter) and 20% of them (167 pairs) in the online part (referred to as dataset 2 hereafter).
3.2 The offline part: typical driving styles learning
We need ego vehicle profiles such as speed and acceleration to identify the driving style of a single vehicle. On the other hand, we need to consider the interaction between the ego vehicle and its leader. We select 13 features as characteristic indexes for vehicle driving style learning, as shown in Table 1.
No. | Characteristic indicators |
---|---|
Maximum speed (m/s) | |
Mean speed (m/s) | |
Standard deviation of speed (m/s) | |
Maximum acceleration (m/s2) | |
Minimum acceleration (m/s2) | |
Mean acceleration (m/s2) | |
Standard deviation of acceleration (m/s2) | |
Maximum gap(m) | |
Minimum gap (m) | |
Average gap (m) | |
Standard deviation of the gap (m) | |
Average speed difference (m/s) | |
Standard deviation of the speed difference (m/s) |
We extracted the above 13 features from 15 seconds of car-following data in each car-following pair from dataset 1. After the feature selection, dimension reduction is carried out on the 13 selected feature indexes such that the low-dimensional representation retains some meaningful properties of the original data. This paper uses Principal Component Analysis (PCA) (Abdi and Williams, 2010) as the dimension reduction method. PCA performs a linear mapping of the data to a lower-dimensional space in such a way that the variance of the data in the low-dimensional representation is maximized. PCA calculation was performed on the trajectory data, and the explained variance ratio by the first five principal components can be found in Table 2. As seen in the table, selecting the first two principal components could represent most of the information (greater than 90%) about vehicle car-following behaviors.
Principal Component | Explained Variance Ratio | Accumulated |
PC1 | 0.862 | 0.862 |
PC2 | 0.083 | 0.945 |
PC3 | 0.042 | 0.987 |
PC4 | 0.004 | 0.991 |
PC5 | 0.003 | 0.994 |
After the dimension reduction process, the K-means algorithm was used to classify typical vehicle driving styles(Krishna and Narasimha Murty, 1999). Firstly, To determine the optimal number of clusters, we have to select the value of at the “elbow,” which is the point after which the Error Sum of Squares (SSE) within the cluster starts decreasing linearly. According to Fig. 2, was selected as the optimal cluster number in this paper. The obtained clustering results are shown in Fig. 3 and Table 3. Physical meanings of PC1 and PC2 are obscure, but we will explain how we label driving styles (“relatively aggressive,” “neutral,” and “timid”) for each driving style cluster in the following subsection.


Cluster Index | Driving Style | Sample Size | Proportion |
---|---|---|---|
0 | neutral | 209 | 31.4% |
1 | relatively aggressive | 409 | 61.4% |
2 | timid | 48 | 7.2% |
3.3 The offline part: car-following parameter estimation for typical driving styles
To label clusters with driving styles, many methods in the literature (Wen et al., 2022; Xue et al., 2019) calculate the mean of features for each cluster. However, in this way, these works cannot make trajectory predictions based on the clustering results.
To fill this gap, this paper estimate the car-following parameter values for each driving style based on the Intelligent Driver Model (IDM) (Treiber et al., 2000). The form of the IDM acceleration is:
(1) |
where ( in this paper) is the free-drive exponent, is the current speed of the follower, is the current speed of the leader, is the current gap between the follower and its leader, is the desired gap, and other parameters are to be estimated based on the trajectory data, as seen in Table 4:
Parameter Name | Physical Meaning | Unit |
---|---|---|
desired velocity | m/s | |
desired time headway | s | |
minimum spacing | m | |
maximum vehicle acceleration | m/s2 | |
comfortable braking deceleration | m/s2 |
For each vehicle, we use the simulation of the IDM model to make trajectory predictions over a 5s duration, which then are compared with the observed trajectories by formulating objective functions in terms of Root Mean Square Error (RMSE):
(2) |
where is the error between the predicted trajectory and the observed trajectory at time , and the value is a function of IDM parameters . The calculation method is shown in Fig. 4.

We can calculate the average RMSE for vehicles in the single driving style cluster. Minimizing reduces the parameter estimation problem to a multivariate nonlinear optimization problem. The parameter estimation results for three driving style clusters are shown in Table 5. The parameter estimation result, , is also calculated if all vehicles are classified into only one cluster.
Cluster Index | |||||
---|---|---|---|---|---|
0 | 34.7 | 1.0 | 2.9 | 0.5 | 1.5 |
1 | 35.0 | 1.0 | 0.1 | 0.4 | 1.5 |
2 | 18.5 | 1.9 | 4.5 | 0.4 | 1.4 |
aggregate | 19.0 | 1.0 | 0.3 | 0.4 | 1.4 |
From the IDM parameters, we can observe that vehicles in“Cluster 2” show a timid driving style, interpreted by small , large , and big . Vehicles in “Cluster 1” and “Cluster 0” have similar IDM parameters but those in “Cluster 1” show relatively aggressive driving behaviors, shown by the smaller value. Based on the analysis, we label the driving style of “Cluster 0” as “neutral,” “Cluster 1” as “relatively aggressive,” and “Cluster 2” as “timid.”
3.4 The online part: fast driving style recognition
This subsection presents two methods to recognize a vehicle’s driving style with limited trajectory data. The effects of the proposed methods will be compared against benchmark models in terms of the RMSE value of the predicted trajectory and the observed trajectory over a 5s duration.
3.4.1 method 1
The idea of method 1 is to repeat the driving style learning process in the offline part. We calculate the same features listed in Table 1 from the online vehicle trajectories. Then the PCA process used in the offline part is applied to reduce the dimension of the feature data. After that, we calculate the distances between this vehicle’s first two principal components (PC1, PC2) and the three cluster centers. The vehicle belongs to the closest cluster center and has the corresponding driving style.
We test method 1 with dataset 2. For each car-following pair in the dataset, trajectory x from to is selected. not only represents the vehicle’s location but includes all information needed for extracting the car-following features. If the driving style of vehicle is recognized as , the IDM parameter set will be used to make predictions for vehicle . The results show that “driving style recognition method 1” can reduce the prediction error by up to 19.7% (2.28m to 1.83m) compared to simply using parameter values recommended in the literature (Punzo et al., 2015), and up to 2.7% (1.88m to 1.83m) compared to use the estimated aggregated parameters . However, method 1 requires a relatively long trajectory to perform well, as shown in Fig. 5. A minimum of 2 seconds of trajectory is required for cluster method 1 to outperform making trajectory predictions with aggregated estimated parameter values .

3.4.2 method 2
Since AVs are required to make online trajectory predictions with minimal observation data, we propose the second clustering method based on local Maximum-Likelihood calibration (Treiber and Kesting, 2013). In this local method, the acceleration generated by the IDM model is compared against the observed data , separately for each data point, through maximum-likelihood techniques (Serge Hoogendoorn, 2010). Toward that end, we assume the noise of the IDM model, , follows a Normal distribution with a standard deviation of ,
(3) |
such that for each vehicle, we can calculate the log-likelihood function as the sum of each time step ,
(4) |
where represents the total data points for driving style recognition from a single vehicle. The sampling frequency of the I-80 data is 10Hz, so if we have 2 seconds of trajectory data, we will have data points for driving style recognition. represents the trajectory data used in equation 4. not only represents the vehicle’s location but includes all exogenous variables needed for the IDM model at time .
In traditional vehicle parameter estimation methods, the parameter values for each vehicle are found by maximizing the log-likelihood function . However, the method in the literature is computing-intensive, which is not ideal for online trajectory prediction. In this paper, we already have three prototype parameter sets: for cluster 0, for cluster 1, and for cluster 2, which simplify the parameter estimation problem to finding the vehicle prototype parameter set that maximizes the log-likelihood:
(5) |
Method 2 is tested with dataset 2, with different values of and different trajectory duration. For each car-following pair, we select trajectory for driving style recognition. We run experiments to obtain the relationship between RMSE and , with three different values of . As seen in Fig 6, this driving recognition method achieves the best performance when . The result is reasonable because human driving behavior is inherently stochastic.

In the following experiment, we choose . The driving style recognition method is then run with different duration of trajectories. A comparison between the two methods proposed in this paper and benchmark methods is shown in Fig. 7. As seen in the figure, method 2 reduces the prediction error by up to 37.7% (2.28m to 1.42m) compared to using parameter values , and up to 24.4% (1.88m to 1.42m) compared to . Moreover, method 2 outperforms method 1 under both short-trajectory and long-trajectory conditions. Even with only 0.1 seconds of trajectory data available, the prediction error is smaller than with other methods.

4 Conslusion
This paper proposes a new hybrid of offline and online methods, allowing AVs to perform trajectory predictions for surrounding vehicles with limited observation data. We test this approach on a real driving dataset. We first classify vehicles into three driving styles and estimate the IDM parameter for each style. After that, we used local Maximum-Likelihood techniques to perform driving style recognition with real-time observation data. We benchmarked our method against other methods in terms of the RMSE value of the predicted trajectory and the observed trajectory over a 5s duration. The results show that the proposed approach can reduce the prediction error by up to 37.7% compared to using the parameters from other literature and up to 24.4% compared to not performing driving style recognition.
While we perform online driving style recognition, we keep accumulating observation data for a single vehicle in this approach. However, a vehicle’s driving style may change over time. In future work, we will investigate how to choose a suitable ”window” of the trajectory data for driving style recognition. Moreover, While this study only uses observation data from vehicles, future work will also extend the methods with the help of sensor data from roadside units. More datasets will be used to assess the performance of the proposed approach in future work. Finally, the framework of the proposed approach shown in Fig. 1 is very flexible so that one can incorporate newer car-following models and other machine-learning algorithms to make online trajectory predictions.
References
- Abdi and Williams (2010) Abdi, H. and Williams, L.J. (2010). Principal component analysis. Wiley interdisciplinary reviews: computational statistics, 2(4), 433–459.
- Bhattacharyya et al. (2022) Bhattacharyya, R., Jung, S., Kruse, L.A., Senanayake, R., and Kochenderfer, M.J. (2022). A hybrid rule-based and data-driven approach to driver modeling through particle filtering. IEEE Transactions on Intelligent Transportation Systems, 23(8), 13055–13068.
- Bhattacharyya et al. (2020) Bhattacharyya, R.P., Senanayake, R., Brown, K., and Kochenderfer, M.J. (2020). Online parameter estimation for human driver behavior prediction. In 2020 American Control Conference (ACC), 301–306.
- Constantinescu et al. (2010) Constantinescu, Z., Marinoiu, C., and Vladoiu, M. (2010). Driving style analysis using data mining techniques. International Journal of Computers Communications & Control, 5(5), 654–663.
- Cui et al. (2019) Cui, H., Radosavljevic, V., Chou, F.C., Lin, T.H., Nguyen, T., Huang, T.K., Schneider, J., and Djuric, N. (2019). Multimodal trajectory predictions for autonomous driving using deep convolutional networks. In 2019 International Conference on Robotics and Automation (ICRA), 2090–2096.
- Ericsson (2000) Ericsson, E. (2000). Variability in urban driving patterns. Transportation Research Part D: Transport and Environment, 5(5), 337–354.
- Gao et al. (2020) Gao, B., Cai, K., Qu, T., Hu, Y., and Chen, H. (2020). Personalized adaptive cruise control based on online driving style recognition technology and model predictive control. IEEE transactions on vehicular technology, 69(11), 12482–12496.
- Gipps (1981) Gipps, P.G. (1981). A behavioural car-following model for computer simulation. Transportation Research Part B: Methodological, 15(2), 105–111.
- Guo et al. (2019) Guo, Q., Zhao, Z., Shen, P., Zhan, X., and Li, J. (2019). Adaptive optimal control based on driving style recognition for plug-in hybrid electric vehicle. Energy, 186, 115824.
- Hou et al. (2020) Hou, L., Xin, L., Li, S.E., Cheng, B., and Wang, W. (2020). Interactive trajectory prediction of surrounding road users for autonomous driving using structural-lstm network. IEEE Transactions on Intelligent Transportation Systems, 21(11), 4615–4625.
- Krishna and Narasimha Murty (1999) Krishna, K. and Narasimha Murty, M. (1999). Genetic k-means algorithm. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 29(3), 433–439.
- Leon and Gavrilescu (2021) Leon, F. and Gavrilescu, M. (2021). A review of tracking and trajectory prediction methods for autonomous driving. Mathematics, 9(6), 660.
- Martinez et al. (2017) Martinez, C.M., Heucke, M., Wang, F.Y., Gao, B., and Cao, D. (2017). Driving style recognition for intelligent vehicle control and advanced driver assistance: A survey. IEEE Transactions on Intelligent Transportation Systems, 19(3), 666–676.
- Montanino and Punzo (2015) Montanino, M. and Punzo, V. (2015). Trajectory data reconstruction and simulation-based validation against macroscopic traffic patterns. Transportation Research Part B: Methodological, 80, 82–106.
- Monteil et al. (2015) Monteil, J., Hara, N.O., Cahill, V., and Bouroche, M. (2015). Real-time estimation of drivers’ behaviour. In 2015 IEEE 18th International Conference on Intelligent Transportation Systems, 2046–2052.
- Murphey et al. (2009) Murphey, Y.L., Milton, R., and Kiliaris, L. (2009). Driver’s style classification using jerk analysis. In 2009 IEEE workshop on computational intelligence in vehicles and vehicular systems, 23–28. IEEE.
- Newell (2002) Newell, G.F. (2002). A simplified car-following theory: a lower order model. Transportation Research Part B: Methodological, 36(3), 195–205.
- Punzo et al. (2011) Punzo, V., Borzacchiello, M.T., and Ciuffo, B. (2011). On the assessment of vehicle trajectory data accuracy and application to the next generation simulation (ngsim) program data. Transportation Research Part C: Emerging Technologies, 19(6), 1243–1262.
- Punzo et al. (2015) Punzo, V., Montanino, M., and Ciuffo, B. (2015). Do we really need to calibrate all the parameters? variance-based sensitivity analysis to simplify microscopic traffic flow models. IEEE Transactions on Intelligent Transportation Systems, 16(1), 184–193.
- Punzo et al. (2021) Punzo, V., Zheng, Z., and Montanino, M. (2021). About calibration of car-following dynamics of automated and human-driven vehicles: Methodology, guidelines and codes. Transportation Research Part C: Emerging Technologies, 128, 103165.
- Serge Hoogendoorn (2010) Serge Hoogendoorn, R.H. (2010). Calibration of microscopic traffic-flow models using multiple data sources. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 368, 4497–4517.
- Sheng et al. (2022) Sheng, Z., Xu, Y., Xue, S., and Li, D. (2022). Graph-based spatial-temporal convolutional network for vehicle trajectory prediction in autonomous driving. IEEE Transactions on Intelligent Transportation Systems, 23(10), 17654–17665.
- Taubman-Ben-Ari et al. (2004) Taubman-Ben-Ari, O., Mikulincer, M., and Gillath, O. (2004). The multidimensional driving style inventory—scale construct and validation. Accident Analysis & Prevention, 36(3), 323–332.
- Treiber et al. (2000) Treiber, M., Hennecke, A., and Helbing, D. (2000). Congested traffic states in empirical observations and microscopic simulations. Physical Review E, 62(2), 1805–1824.
- Treiber and Kesting (2013) Treiber, M. and Kesting, A. (2013). Microscopic calibration and validation of car-following models – a systematic approach. Procedia - Social and Behavioral Sciences, 80, 922–939. 20th International Symposium on Transportation and Traffic Theory (ISTTT 2013).
- Vaitkus et al. (2014) Vaitkus, V., Lengvenis, P., and Žylius, G. (2014). Driving style classification using long-term accelerometer information. In 2014 19th International Conference on Methods and Models in Automation and Robotics (MMAR), 641–644. IEEE.
- Wen et al. (2022) Wen, S., Shahd Omar, X.G., Jin, X., and He, Z. (2022). Analysis of vehicle driving styles at freeway merging areas using trajectory data. In 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), 3652–3656.
- Xu et al. (2015) Xu, L., Hu, J., Jiang, H., and Meng, W. (2015). Establishing style-oriented driver models by imitating human driving behaviors. IEEE Transactions on Intelligent Transportation Systems, 16(5), 2522–2530.
- Xue et al. (2019) Xue, Q., Wang, K., Lu, J.J., and Liu, Y. (2019). Rapid driving style recognition in car-following using machine learning and vehicle trajectory data. Journal of advanced transportation, 2019.