Applicability of Data Mining and Predictive Analysis for Tobacco Cessation – An Exploratory Study

  • Kavita Rijhwani Post Graduate Student, Department of Public Health Dentistry, Maulana Azad Institute of Dental Sciences, Delhi, India
  • Vikrant R R Mohanty Associate Professor and Head, Department of Public Health Dentistry, Maulana Azad Institute of Dental Sciences, Delhi, India
  • Aswini YB Assistant Professor, Department of Public Health Dentistry, Maulana Azad Institute of Dental Sciences, Delhi, India
  • Vaibhav Singh Assistant Professor, Department of Computer Science, Rameshwaram Institute of Technology and Management, Lucknow(U.P), India
  • Sumbul Hashmi Post Graduate Student, Department of Public Health Dentistry, Maulana Azad Institute of Dental Sciences, Delhi, India
Keywords: Data Mining, Tobacco Cessation, Classification Algorithms, Prediction

Abstract

OBJECTIVES: Predictive analysis can be used effectively to evaluate enormous data generated by health care industry to extract information and establish relationships amongst the variables. Unlike traditional statistical methods, it uses artificial intelligence to reveal associations which health care professionals would never even suspect. Tobacco cessation is clearly beneficial, however many tobacco users respond differently as it is based on multitude of factors.  So our objectives to understand the data mining techniques using WEKA tool and its role in predictive analysis as well as to predict the quit status of patients using prediction algorithms in tobacco cessation. 

METHODS: WEKA a Data Mining Tool used to classify the data and evaluated using 10-fold cross-validations. The various algorithms used in this tool are Naïve Bayes, SMO, Random Forest, J-48 and Decision stump to further analyse its role in determining the quit status of patients. For this secondary data of 655 patients from Tobacco Cessation Clinic was utilized and described using 20 different attributes for prediction of quit status.

RESULTS: The result showed that Decision stump and SMO was found to be having a best prediction and accuracy for prediction of quitting status. Out of 20 attributes, previous attempt of quitting, type of intervention, number of years since habit initiated were found to be associated with early quitting rate.

CONCLUSION: This study concludes that Data mining & predictive analytics models like WEKA tool will not only improve patient outcomes but understand variables or combination of variables for effective interventions in tobacco cessation.

Author Biographies

Vikrant R R Mohanty, Associate Professor and Head, Department of Public Health Dentistry, Maulana Azad Institute of Dental Sciences, Delhi, India

ASSOCIATE PROFESSOR & HEAD,

DEPARTMENT OF PUBLIC HEALTH DENTISTRY

MAULANA AZAD INSTITUTE OF DENTAL SCIENCES, NEW DELHI

Aswini YB, Assistant Professor, Department of Public Health Dentistry, Maulana Azad Institute of Dental Sciences, Delhi, India

ASSISTANT PROFESSOR,

DEPARTMENT OF PUBLIC HEALTH DENTISTRY

MAULANA AZAD INSTITUTE OF DENTAL SCIENCES, NEW DELHI

Vaibhav Singh, Assistant Professor, Department of Computer Science, Rameshwaram Institute of Technology and Management, Lucknow(U.P), India

ASSISTANT PROFESSOR, 

DEPARTMENT OF COMPUTER SCIENCES

RAMESHWARAM INSTITUTE OF TECHNOLOGY AND MANAGEMENT, LUCKNOW

Sumbul Hashmi, Post Graduate Student, Department of Public Health Dentistry, Maulana Azad Institute of Dental Sciences, Delhi, India

POST GRADUATE STUDENT,

DEPARTMENT OF PUBLIC HEALTH DENTISTRY

MAULANA AZAD INSTITUTE OF DENTAL SCIENCES, NEW DELHI

References

World Health Organization, Research for International Tobacco Control. WHO report on the global tobacco epidemic, 2008: the MPOWER package. World Health Organization; 2008 Feb 11.

Jha P, Jacob B, Gajalakshmi V, Gupta PC, Dhingra N, Kumar R, Sinha DN, Dikshit RP, Parida DK, Kamadod R, Boreham J. A nationally representative case–control study of smoking and death in India. N Engl J Med. 2008 Mar 13;358(11):1137-47.

Shimkhada R, Peabody JW. Tobacco control in India. Bulletin of the World Health Organization. 2003 Jan;81(1):48-52.

Thankappan KR. Tobacco cessation in India: A priority health intervention. Indian J. Med. Res. 2014 Apr;139(4):484.

Murthy P, Saddichha S. Tobacco cessation services in India: Recent developments and the need for expansion. Indian J. Cancer. 2010 Jul 1;47(5):69.

World Health Organization. WHO report on the global tobacco epidemic, 2013: enforcing bans on tobacco advertising, promotion and sponsorship. World Health Organization; 2013.

Kaur J, Jain DC. Tobacco control policies in India: implementation and challenges. Indian J Public Health. 2011 Jul 1;55(3):220.

Persai D, Panda R, Gupta A. Examining implementation of tobacco control policy at the district level: a case study analysis from a high burden state in India. Adv Prev Med. 2016 Jan 3;2016.

Hand DJ, Mannila H, Smyth P. Principles of data mining. MIT press; 2001. Retrievedfrom:https://books.google.co.in/books?hl=en&lr=&id=SdZbhVhZGYC&oi=fnd&pg=PR17&dq=Principles+of+data+mining.+MIT+press%3B+2001&ots=yxT6viunlY&sig=5YIFsgMxtb2vruYBoSK2gVM3GRU#v=onepage&q&f=false

Koh HC, Tan G. Data mining applications in healthcare. J Healthc Inf Manag. 2011 Jan;19(2):65. doi:http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.92.3184&rep=rep1&type=pdf.

Vijayarani S, Sudha S. Comparative analysis of classification function techniques for heart disease prediction. ‎IJIRCCE. 2013 May;1(3):735-41.

Joshi J, Doshi R, Patel J. Diagnosis and prognosis breast cancer using classification rules. Int. j. eng. res.gen. sci. 2014 Oct;2(6):315-23..Retreivedfrom http://www.ijergs.org/files/documents/DIAGNOSIS-39.pdf .

Vijayarani S, Dhayanand S, Phil M. Kidney disease prediction using SVM and ANN algorithms. IJCBR. 2015;6(2).

http://www.researchmanuscripts.com/March2015/2.pdf

Kumar MN. Alternating decision trees for early diagnosis of dengue fever. arXiv preprint arXiv:1305.7331. 2013 May 31. https://arxiv.org/abs/1305.7331

Caponnetto P, Polosa R. Common predictors of smoking cessation in clinical practice. Respir Med. 2008 Aug 31;102(8):1182-92.

doi:http://dx.doi.org/10.1016/j.rmed.2008.02.017.

Kositbowornchai S, Siriteptawee S, Plermkamon S, Bureerat S, Chetchotsak D. An artificial neural network for detection of simulated dental caries. Int J Comput Assist Radiol Surg. 2006 Aug 1;1(2):91-6. doi:10.1007/s11548-006-0040-x.

Bahaa K, Noor G, Yousif Y. The Artificial Intelligence Approach for Diagnosis, Treatment and Modelling in Orthodontic. In Principles in Contemporary Orthodontics 2011. InTech..

Shankarapillai R, Mathur LK, Nair MA, Rai N, Mathur A. Periodontitis risk assessment using two artificial neural networks-a pilot study. Int. j. dent. clin. 2010 Dec 31;2(4).

Remco R. Bouckaert, Eibe Frank, Mark Hall, Richard Kirkby, Peter Reutemann, Alex Seewald and David Scuse. (2013). “WEKA Manual form Version 3-7-10”, 2013.https://pdfs.semanticscholar.org/d617/d41097bdf97d994d1481adbcfe0c05a51696.pdf

Solanki AV. Data mining techniques using WEKA classification for Sickle Cell Disease. Int. J. Inf. Technol. 2014;5(4):5857-60.

http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.636.3433&rep=rep1&type=pdf

Yasodha P, Kannan M. Analysis of a population of diabetic patients databases in WEKA tool. J. sci. eng. res. 2011 May; 2(5):15.

https://pdfs.semanticscholar.org/35ae/5e033d0c3e87741a4328b9129c2c490af7b9.pdf

Durairaj M, Ranjani V. Data mining applications in healthcare sector: a study. IJSTR. 2013 Oct 25;2(10):29-35.

Dhamodharan S. Liver Disease Prediction Using Bayesian Classification. In4th National Conference on Advanced Computing, Applications & Technologies 2014 May. doi: http://dx.doi.org/10.6084/ijact.v0i0.443

Kopycka‐Kędzierawski DT, Billings RJ. Application of nonhomogenous Markov models for analyzing longitudinal caries risk. Community Dent Oral Epidemiol. 2006 Apr 1;34(2):123-9.

Bratthall D, Hänsel Petersson G. Cariogram–a multifactorial risk assessment model for a multifactorial disease. Community Dent Oral Epidemiol. 2005 Aug 1;33(4):256-64. doi: 10.1111/j.1600-0528.2005.00233.x

Sharma N, Om H. Data mining models for predicting oral cancer survivability. Netw Model Anal Health Inform Bioinform. 2013 Dec 1;2(4):285-95. doi:10.1007/s13721-013-0045-7.

Oberoi SS, Sharma G, Nagpal A, Oberoi A. Tobacco cessation in India: how can oral health professionals contribute. Asian Pac J Cancer Prev. 2014 Jan 1;15:2383-91.

Published
2019-05-29
How to Cite
1.
Rijhwani K, Mohanty VR, YB A, Singh V, Hashmi S. Applicability of Data Mining and Predictive Analysis for Tobacco Cessation – An Exploratory Study. Front Dent. :Uncorrected Proof.
Section
Original Article