Data Science Lab

The Data Science Lab focuses on applying scientific methods and algorithms to extract knowledge and insights from structured and unstructured data and to apply to a broad range of application domains. Few of the thrust areas include natural language processing-based application development, AI-based expert system development, and applied machine learning models in marketing, healthcare, finance, and banking, etc.

Lab Activities

Introduce industry relevant courses in the curriculum

Make students industry-ready through internships, projects on real world problems, expert talks and summer courses

Organize seminars, faculty development programs and lecture series for faculty members

Promote multi-disciplinary innovative research and application development

Lab Resources

Experienced faculty members with domain expertise

Well equipped research laboratory with systems with advanced processing capacity, licensed software and tools

Computer server with attached NAS server to provide a world-class IT infrastructure

Uninterrupted 24x7 power supply and high speed internet facility

30-seater training room for running hands on courses and workshops

12-seater office-cum-conference room for discussions

Access to high quality journals (IEEE, Elsevier, Springer) and other digital repositories

Collaborate with us.

GET IN TOUCH

Courses & Trainings

Summer course on Data Science
01 Mar, 2021 - 30 Jun, 2021

This course is specifically designed for students with the objective to simulate industry experience for enhancing their skills by associating with industry experts. It will help them develop sound knowledge on the area of data science and execute industry relevant technical projects under the guidance of experts from industry.
After completion of this course, the students will be able to:

  • understand the patterns in varied forms of data and their behavior
  •  understand how machine learning models can be created (Regression, Classification, Clustering)
  • understand how self-learning machines can ingest continuous pipelines of user data and find patterns, thereby predicting the future with a significant confidence
  • understand predictive modelling, sentiment analysis, user clustering and deep learning with their applications
  • design different models on varied forms of data to present solutions to many data science related problems.

Industry Expert: Parijat Roy, Data Scientist, Microsoft; 6+ years of industry experience; 6+ years of experience in data science

Summer course on Data Science
01 Jan, 2020 - 01 Jun, 2020

The summer course on Data Science is designed specifically for the students for providing them training in the domain. Highlight of this program is the industry relevant technical project which the students are doing under the guidance of an expert from industry. This course will help the students develop sound knowledge on the area of data science.
After completion of the course, the students will be able to:

  • understand various machine learning models
  • design projects on data science to validate the understandings

Industry Experts: Parijat Roy, Data Scientist, Microsoft; 6+ years of industry experience; 6+ years of experience in data science

Lecture Series on Data Science
20 Nov, 2020 - 31 Jul, 2021

The objective of this lecture series is to enhance the skill of faculty members in the domain of Data Science. A series of sessions will be conducted with a focus on different machine learning models and their applications on varied forms of data.
After completion of this lecture series, the faculty members will be able to:

  • understand various machine learning models based on supervised or unsupervised learning mechanism
  • apply the machine learning models on different computational problems in the broad domain of data science
  • execute independent research in different application domains

Academic Expert: Dr. Sudarsan Padhy, Professor, SIT, Bhubaneswar; 46+ years of teaching and research experience.

Workshop on Machine Learning for Data Science using Python (MLDSP-2020)
15 Jan, 2020 - 19 Jan, 2020

The objective of this workshop is to introduce participants to state-of-the-art methods in Machine Learning. Ten resource persons from Academia and Industry delivered talks and conducted hands-on sessions using python during the workshop.
After completion of this workshop, the participants will be able to

  • understand different machine learning models: Linear Regression, Logistic Regression, Gradient Descent Method, and clustering techniques
  • understand the applications of machine learning models in Forecasting, Recommended Systems, Text and Speech Analytics
  • apply different machine learning models on different types of data set and computational problems.

Industry Experts: Dr. Sriparna Saha (IIT Patna), Dr. Ajaya Anand (KIIT, University, Bhubanewar), Dr. Priyadarsan Patra (Xavier University, Bhubaneswar), Dr. Sudarshan Padhy (SIT, Bhubaneswar), Dr. Rakesh Chandra Balabantaray (IIIT, Bhubaneswar), Dr. Rudra Mohan Tripathy (Xavier University, Bhubaneswar), Dr. Debi Das (IMMIT, Bhubaneswar), Dr. Asish Ghosh (ISI, Kolkata), Dr. Ajit Kumar Nayak (SOA university, Bhubaneswar), Dr. Pravat Kumar Santi (TCS Bhubaneswar)

Training Programs on Data Science
01 Mar, 2019 - 01 Sep, 2019

The objective of this training program is to enhance the skill of our students in the broad domain of Data Science by associating them with industry experts.
After completion of this training, the students will be able to:

  • understand the different data pre processing methods
  • understand how machine learning models can be applied on varied forms of data
  • apply machine learning algorithms on different data science application domains
  • Design projects on data science
  • Industry Expert: Parijat Roy, Data Scientist, Microsoft; 6+ years of industry experience; 6+ years of experience in data science
FDP on ‘Data Science’
13 Jul, 2019 - 14 Jul, 2019

The objective of the Faculty Development Program on Data Science is to provide a brief overview of the Data science broad domain to the faculty members.
After completion of this program, the participants will be able to:

  • understand the broad domain of data science
  • understand the opportunities and challenges in applied data science with different Industry case studies
  • identify active areas of research and possible road-map for developing data science knowledge and research

Industry Expert: Dr. Sumit Misra, Associate Vice President, RS Software; an Architect, Researcher, Technology Evangelist, Ph.D. Data Science, 30+ years of industry experience

Lecture series on ‘Machine Learning’
01 Aug, 2019 - 01 Dec, 2019

The objective of this lecture series is to enhance the skill of faculty members in the domain of Machine Learning to prepare them to teach the course. 10 lectures were delivered on selected topics of Machine Learning.
After completion of this lecture series, the faculty members will be able to:

  • understand the basic concepts of machine learning
  • understand different machine learning models and their applications
  • apply different machine learning models on different application domains of data science

Academia Expert: Dr. Sudarsan Padhy, Professor, SIT, Bhubaneswar; 46+ years of teaching and research experience

Seminar on ‘Reproducible Research’
23 Feb, 2018

The objective of conducting a seminar for the faculty members on the topic- “Reproducible Research” is to explain the different approaches and software tools used towards making research reproducible.
After attending the seminar, the faculty members will be able to:

  • understand different approaches and tools for reproducible research.
  • apply the available tools for making research reproducible.

Academia Expert: Dr. Smruti Padhy, Research Scientist, MIT, USA

National Workshop on Data Analytics Using R (DAUR–2018)
06 Jul, 2018 - 12 Jul, 2018

The objective of the National Workshop on Data Analytics Using R is to provide training to the faculty members teaching/ planning to teach Data Analytics/ Machine Learning courses. Six resource persons from Academia and Industry delivered talks and conducted hands-on sessions during the workshop on various Data Analytics techniques and models.
After completion of the workshop, the participants will:

  • have grounding on various Data Analytics techniques along with their theoretical foundation, algorithms, and practical examples to teach/ to do research and development in Data Analytics.
  • learn to use R in Data Analytics problems
  • understand broad opportunities for automation with Data Analytics and apply it in practice
  • formulate problems as Data Analytics tasks and solve them

Experts: Dr. Debasish Samanta (IIT, Kharagpur), Dr. Rudra Mohan Tripathy (Xavier University Bhubaneswar), Prof. Sudarsan Padhy (SIT, Bhubaneswar), Prof. Saroj Kanta Misra (SIT, Bhubaneswar), Prof. Bijan Bihari Misra (SIT, Bhubaneswar), Pradipta Kumar Pattanayak (SIT, Bhubaneswar)

Lecture Series on Deep Learning and its Applications
01 Nov, 2017 - 01 Jun, 2017

The objective of the lecture series on Deep Learning and its Applications is to enhance the skill of faculty members in the domain to carry out advanced research in different domains applying deep learning models. 8 lectures were delivered during this session.
After completion of the lecture series, the faculty members will:

  • have a clear understanding on the deep learning architecture
  • understand implementation of several algorithms using Python
  • be able to apply deep learning on different application domains

Academia Expert: Dr. Sudarsan Padhy, Professor, SIT, Bhubaneswar, 46+ years of teaching and research experience

Projects, People & Publications

Computer Science and Engineering

Computer Science and Engineering

Computer Applications

Computer Science and Engineering

Computer Applications

Computer Science and Engineering

Computer Science and Engineering

Computer Science and Engineering

Computer Science and Engineering

Computer Science and Engineering

Sasmita Mishra, SudarsanPadhy, and Satya Narayan Mishra,Satya Narayan Misra

, "A novel LASSO–TLBO–SVR hybrid model for an efficient portfolio construction", The North American Journal of Economics and Finance, Elsevier, Vol. 55, Article ID: 101350, 0123,

DOI: 10.1016/j.najef.2020.101350

.

Sarat Chandra Nayak and Bijan Bihari Misra

, "Extreme learning with chemical reaction optimization for stock volatility prediction", Financial Innovation, Springer, Vol. 6, Article ID: 16, 0223,

DOI: 10.1186/s40854-020-00177-2

.

Soumya Priyadarsini Panda, V. Pandit, S. Chaturvedi, Rohit Kumar, and Akash Das

, "Natural Language Query Based Question Answering System", International Journal of Engineering and Advanced Technology, Vol. 9, No. 3, pp. 2977-2981, 0223,

DOI: 10.35940/ijeat.B3522.029320

.

Pamela Chaudhury and Hrudaya Kumar Tripathy

, "A novel academic performance estimation model using two stage feature selection", Indonesian Journal of Electrical Engineering and Computer Science, Vol. 19, No. 3, pp. 1610-1619, 0523,

DOI: 10.11591/ijeecs.v19.i3.pp1610-1619

.

Soumya Priyadarsini Panda, Ajit Kumar Nayak, and Satyananda Champati Rai

, "A Survey on Speech Synthesis Techniques in Indian Languages", Multimedia Systems, Springer, Vol. 26, No. 4, pp. 453-478, 0523,

DOI: 10.1007/s00530-020-00659-4

.

Chapala Maharana, Ch. Sanjeev Kumar Dash, Bijan Bihari Mishra, and Satchidananda Dehuri

, "Feature Selection for Classification Using Extreme Learning Machine.", Journal of Emerging Technologies and Innovative Research (JETIR), Vol. 7, No. 6, pp. 769-774, 0623,

DOI: 10.6084/m9.figshare.JETIR2006103

.

Sabyasachi Mohanty, Sudarsan Padhy, and M. Vamsi Krishna

, "A novel
EN-TLBO-SVR model for analysing achievements of government schemes", Electronic Government, Inderscience, Vol. 16, No. 3, pp. 281-303, 0723,

DOI: 10.1504/EG.2020.108494

.

Debasmita Pradhan, Biswajit Sahoo, Bijan Bihari Misra, and SudarsanPadhy

, "A multiclass SVM classifier with teaching learning based feature subset selection for enzyme subclass classification ", Applied Soft Computing, Elsevier, Vol. 96, Article ID: 106664, 1123,

DOI: 10.1016/j.asoc.2020.106664

.

Ch Sanjeev Kumar Dash, Ajit Kumar Behera, Satchidananda Dehuri, and Sung-Bae Cho

, "Building a Novel Classifier Based on Teaching Learning Based Optimization and Radial Basis Function Neural Networks for Non-Imputed Database with Irrelevant Features ", Applied Computing and Informatics, Elsevier, 0323,

DOI: 10.1016/j.aci.2019.03.001

.

Pamela Chaudhury and Hrudaya Kumar Tripathy

, "Optimising the parameters of a RBFN network for a teaching learning paradigm", Indonesian Journal of Electrical Engineering and Computer Science, Vol. 15, No. 1, pp. 435-442, 0423,

DOI: 10.11591/ijeecs.v15.i1.pp435-442

.

Ch Sanjeev Kumar Dash, Ajit Kumar Behera, Sarat Chandra Nayak, Satchidananda Dehuri, and Sung-Bae Cho

, "An Integrated CRO and FLANN Based Classifier for a Non-Imputed and Inconsistent Dataset", International Journal on Artificial Intelligence Tools, Vol. 28, No. 3, Article ID: 1950013, 0523,

DOI: 10.1142/S0218213019500131

.

SasmitaMishra and SudarsanPadhy

, "An efficient portfolio construction model using stock price predicted by support vector regression ", The North American Journal of Economics and Finance, Elsevier, Vol. 50, Article ID: 101027, 0723,

DOI: 10.1016/j.najef.2019.101027

.

Soumya Priyadarsini Panda, Varun Behera, Alloran Pradhan, and Abhisekh Mohanty

, "A Rule-based Information Extraction System", International Journal of Innovative Technology and Exploring Engineering, Vol. 8, No. 9, pp. 1613-1617, 0723,

DOI:10.35940/ijitee.I8156.078919

.

Sarat Chandra Nayak and Bijan Bihari Misra

, "A chemical-reaction-optimization-based neuro-fuzzy hybrid network for stock closing price prediction ", Financial Innovation, Springer, Vol. 5, No.1, Article ID: 38 , 1123,

DOI: 10.1186/s40854-019-0153-1

.

Rasmita Dash and Bijan Bihari Misra

, "Performance analysis of clustering techniques over microarray data: A case study", Physica A: Statistical Mechanics and its Applications, Elsevier, Vol. 493, pp. 162-176, 0323,

DOI: 10.1016/j.physa.2017.10.032

.

Sarat Chandra Nayak, Bijan Bihari Misra, and Himansu Sekhar Behera

, "ACFLN: artificial chemical functional link network for prediction of stock market index", Evolving Systems, Springer, pp. 1-26, 0323,

DOI: 10.1007/s12530-018-9221-4

.

Pamela Chaudhury and Hrudaya Kumar Tripathy

, "A Study on impact of smartphone addiction on academic performance", International Journal of Engineering & Technology, Vol. 7, No. 2.6, pp. 50-53, 0323,

DOI: 10.14419/ijet.v7i2.6.10066

.

Rasmita Dash and Bijan Bihari Misra

, "A multi-objective feature selection and classifier ensemble technique for microarray data analysis ", International Journal of Data Mining and Bioinformatics, Inderscience, Vol. 20, No. 2, pp. 123-160, 0723,

DOI: 10.1504/IJDMB.2018.10015103

.

S. C. Nayak and Bijan Bihari Misra

, "Estimating Stock Closing Indices Using a GA-Weighted Condensed Polynomial Neural Network ", Financial Innovation, Springer, Vol. 4, No. 21, pp. 1-22, 0923,

DOI: 10.1186/s40854-018-0104-2)

.

Soumya Priyadarsini Panda, Ajit Kumar Nayak

, "A Context-based Numeral Reading Technique for Text to Speech Systems", International Journal of Electrical and Computer Engineering, Vol. 8, No. 6, pp. 4533-4544, 1023,

DOI: 10.11591/ijece.v8i6.pp4533-4544

.

Sarat C. Nayak, Bijan Bihari Misra, and Himanshu S. Behera

, "Exploration and Incorporation of Virtual Data Position for Efficient Forecasting of Financial Time Series ", International Journal of Industrial and Systems Engineering, Vol. 26, No. 1, pp. 42-62, 0323,

DOI: 10.1504/IJISE.2017.10003891

.

Debasmita Pradhan, Sudarsan Padhy, and Biswajit Sahoo

, "Enzyme Classification using Multiclass Support Vector Machine and Feature Subset Selection ", Computational Biology and Chemistry, Elsevier, Vol. 70, pp. 211-219, 0823,

DOI: 10.1016/j.compbiolchem.2017.08.009

.

Chapala Maharana, Ch. Sanjeev Kumar Dash, and Bijan Bihari Misra

, "Extreme Learning Machine Classifier: a topical state of the art-survey", International Journal of Engiineering Science Invention, pp. 48-59, 1023.

Soumya Priyadarsini Panda and Ajit Kumar Nayak

, "A Waveform Concatenation Technique for Text-to-Speech Synthesis", International Journal of speech Technology, Springer, Vol. 20, No. 4, pp. 959-976, 1023,

DOI: 10.1007/s10772-017-9463-8

.

Pamela Chaudhury and Hrudaya Kumar Tripathy

, "An empirical study on attribute selection of student performance prediction model", International Journal of Learning Technology, Inderscience, Vol. 12, No. 3, pp. 241-252, 1223,

DOI: 10.1504/IJLT.2017.088407

.

Rasmita Dash
and Bijan Bihari Misra

, "Pipelining the Ranking Techniques for Microarray Data Classification: A case study", Applied Soft Computing, Elsevier, Vol. 48, pp. 298-316, 1123,

DOI: 10.1016/j.asoc.2016.07.006

.

Alka Ranjan and Soumya Priyadarsini Panda

, "A Semantic based Information Retrieval System", in Proceedings of International Conference on Communication, Circuits, and Systems (iC3S 2020), Springer, KIIT, Bhubaneswar, pp. 517-523, 0423,

DOI: 10.1007/978-981-33-4866-0_63

.

Soumya Priyadarsini Panda, and Jasaswi Prasad Mohanty

, "A Domain Classification-based Information Retrieval System", in Proceedings of 6th IEEE international Women in Engineering (WIE) Conference on Electrical and Computer Engineering 2020 (WIECON-ECE 2020) , IEEE, IEEE Bhubaneswar Subsection, pp. 122-125, 0423,

DOI: 10.1109/WIECON-ECE52138.2020.9398018.

.

Sarat Chandra Nayak, Ch. Sanjeev Kumar Dash, Bijan Bihari Mishra, and Satchidananda Dehuri

, " Multi-verse optimization of multilayer perceptrons (MV-MLPs) for efficient modeling and forecasting of crude oil prices data ", in Proceedings of International Conference on Biologically Inspired Techniques in Many-Criteria Decision Making (BITMDM-2019) , Springer, Fakir Mohan University, Balasore, pp. 46-54, 1223,

DOI: 10.1007%2F978-3-030-39033-4_4

.

Ajit Kumar Behera and Mrutyunjaya Panda

, "Software reliability prediction with ensemble method and virtual data point incorporation ", in Proceedings of International Conference on Biologically Inspired Techniques in Many-Criteria Decision Making (BITMDM-2019) , Springer, Fakir Mohan University, Balasore, pp. 69-77, 1223,

DOI: 10.1007/978-3-030-39033-4_7

.

Lambodar Jena, Soumen Nayak, and Ramakrushna Swain

, "Chronic Disease Risk (CDR) Prediction in Biomedical Data Using Machine Learning Approach", in Proceedings of ICAC 2019, Springer, SoA University, Bhubaneswar, pp 232-239, 0123,

DOI: 10.1007%2F978-981-15-2774-6_29

.

Soumya Priyadarsini Panda, Ajit Kumar Nayak

, "Spectral Smoothening based Waveform Concatenation Technique for Speech Quality Enhancement in Text to Speech Systems ", in Proceedings of 3rd International Conference on Advanced Computing and Intelligent Engineering,, Springer, SoA University, Bhubaneswar, pp. 425-432, 0223,

DOI: 10.1007%2F978-981-15-1081-6_36

.

Soumya Priyadarsini Panda

, "Intelligent Voice-based Authentication System", in Proceedings of 3rd International conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud, IEEE, SCAD Institute of Technology, Tamilnadu, pp. 757-760, 0323,

DOI:
10.1109/I-SMAC47947.2019.9032671

.

Debolina Mahapatra, Chandan Maharana, Soumya Priyadarsini Panda, Jasaswi Prasad Mohanty, Abu Talib, and Amit Mangaraj

, "A Fuzzy-Cluster based Semantic Information Retrieval System", in Proceedings of International Conference on Computing Methodologies and Communication (ICCMC 2020), IEEE, SEC, Tamilnadu, pp. 675-678, 0423,

DOI: 10.1109/ICCMC48092.2020.ICCMC-000125

.

Sarat C. Nayak,Ch Sanjeev Kumar Dash, Ajit Kumar Behera, and Satchidananda Dehuri

, "Improving Stock Market Prediction Through Linear Combiners of Predictive Models", in Proceedings of International Conference on Computational Intelligence in Data Mining (ICCIDM2018), Springer, VSSUT, Burla, pp. 415-426, 0823,

DOI: 10.1007/978-981-13-8676-3_36

.

Alloran Pradhan, Varun Behera, Abhisekh Mohanty, and Soumya Priyadarsini Panda

, "A Voice-based Information Extraction System", in Proceedings of 3rd International Conference on Smart Computing & Informatics (SCI2018), Springer, KIIT, Bhubaneswar, pp. 593-602, 0923,

DOI: 10.1007/978-981-13-9282-5_56

.

Soumya Priyadarsini Panda

, "Automated Speech Recognition System in Advancement of Human-Computer Interaction", in Proceedings of International Conference on Computing Methodologies and Communication (ICCMC 2017), IEEE, SEC, Tamil Nadu, pp. 302-305, 0223,

DOI: 10.1109/ICCMC.2017.8282696

.

Soumya Priyadarsini Panda and Ajit Kumar Nayak

, "Vowel Onset Point based Waveform Concatenation Technique for Intelligible Speech Synthesis ", in Proceedings of International Conference on Computing Methodologies and Communication (ICCMC 2017), IEEE, SEC, Tamil Nadu, pp. 622-626, 0223,

DOI: 10.1109/ICCMC.2017.8282542

.

Ajit Kumar Behera, Sarat Chandra Nayak, Ch. Sanjeev Kumar Dash, Satchidananda Dehuri, Mrutyunjaya Panda

, "Improving S/W Reliability Prediction Accuracy using CRO based FLANN", in Proceedings of 5th International Conference on Innovations in Computer Science & Engineering (ICICSE-2017) , Springer, Guru Nanak Institute, India, pp 213-220, 0523,

DOI: 10.1007/978-981-10-8201-6_24

.

Lambodar Jena and Ramakrushna Swain

, "Chronic Disease Risk Prediction using Distributed Machine Learning Classifiers", in Proceedings of 16th International Conference on Information Technology (ICIT 2017), IEEE, SIT, Bhubaneswar, pp. 170-173, 0623,

DOI 10.1109/ICIT.2017.46

.

Soumya Priyadarsini Panda and Ajit Kumar Nayak

, "A Rule-based Text Tokenization Technique for Text to Speech Conversion in Indian Languages ", in Proceedings of 2nd International Conference on Inventive Computation Technologies ( ICICT 2017) , IEEE, RVS Technical Campus, Coimbatore, pp. 976-980, 0623.

Pamela Chaudhury, Sushruta Misra, Hrudaya Kumar Tripathy, and Brojo Kumar Misra

, "Enhancing the Capabilities of Student Result Prediction System", in Proceedings of 2nd International Conference on Information and Communication Technology for Competitive Strategies (ICTCS-2016) , ACM, Udaipur, Rajasthan, India, Article ID: 91, 0923,

DOI: 10.1145/2905055.2905150

.

Sushruta Misra, Pamela Chaudhury, Hrudaya Kumar Tripathy, and Brojo Kumar Misra

, "An Implementation of Feature Ranking using Machine Learning Techniques for Diabetes Disease Prediction ", in Proceedings of 2nd International Conference on Information and Communication Technology for Competitive Strategies (ICTCS-2016) , ACM, Udaipur, Rajasthan, India, Article ID: 42, 0923,

DOI: 10.1145/2905055.2905100

.

Ajit Kumar Behera and Mrutyunjaya Panda

, "Efficient Software Reliability Prediction With Evolutionary Virtual Data Position Exploration ", in Title: Handbook of Research on Automated Feature Engineering and Advanced Applications in Data Science , IGI Global, Chapter. 16, pp. 275-285, 0123,

DOI: 10.4018/978-1-7998-6659-6.ch016

.

Sarat Chandra Nayak, Bijan Bihari Misra, and Himansu Sekhar Behera

, "On Developing and Performance Evaluation of Adaptive Second Order Neural Network With GA-Based Training (ASONN-GA) for Financial Time Series Prediction   ", in Title: Advancements in Applied Metaheuristic Computing, IGI Global, Chapter. 10, pp. 231-263, 0323,

DOI: 10.4018/978-1-5225-4151-6.ch010

.

Ch. Sanjeev K. Dash, Ajit K. Behera, and Sarat C. Nayak

, "DE-Based RBFNs for Classification With Special Attention to Noise Removal and Irrelevant Features ", in Title: Handbook of Research on Modeling, Analysis, and Application of Nature-Inspired Metaheuristic Algorithms , IGI Global, Chapter. 11, pp. 218-243, 0823,

DOI: 10.4018/978-1-5225-2857-9.ch011

.

Keep up to date with research outputs from the Silicon research community.

VIEW RESEARCH PUBLICATIONS