•  
  •  
 

Abstract

The crime rate in India is considerably increasing day by day. Consequently, the data associated with crime is also increasing, opening doors for data-driven approaches to these data to extract insightful knowledge, which can help police and other law enforcement organizations of the country in crime control and prevention. Crime prediction using machine learning algorithms on crime data can predict region-wise crime counts. In this paper, a machine learning-based soft computing regression analysis approach for Indian Crime Data Analysis (ICDA) is proposed. Different regression algorithms, namely, Simple Linear Regression (SLR), Multiple Linear Regression (MLR), Decision Tree Regression (DTR), Support Vector Regression (SVR), and Random Forest Regression (RFR) are uses to build regression models. These regression models can predict a total number of Indian Penal Code (IPC) crime counts and crime counts of different types of crime (murder, rape, kidnapping and abduction, riots, to name a few) region-wise and state-wise and all over the country for a given year. Adjusted R squared value and Mean Absolute Percentage Error (MAPE) is used to evaluate and compare proposed regression models. In the proposed approach for ICDA, district-wise spatial-temporal crime data of years 2001 to 2012 is used, collected from the official website of NCRB. For the chosen data, it is concluded that the region-wise total IPC crime prediction RFR model fits the best with an adjusted R squared value of 0.9631551 and an error of 0.2027437. Whereas for region-wise thefts crime count prediction, the RFR model fits the best with an adjusted R squared value of 0.966604 and an error of 0.16571.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.

Share

COinS