Download PDFOpen PDF in browser

Big Data and machine learning in medicine: the main problems

6 pagesPublished: June 4, 2018


Big data and deep learning technologies play an important role in the modern scientific world. The tendency to work with huge data sets is now conquering the medical area. In this article, based on the experience of the Department of medical cybernetics and informatics of the RNRMU Medical and biological Faculty, we explain the main issues that re- searchers deal with in collection and processing of medical data. We explain that problems may relate to data sources issues, semantic interoperability, data relevance, multidimensionality, completeness, and comparability. Modern digital health records and their services like EHR nowadays cannot provide necessary “Big Data” information. The healthcare system makes it impossible to collect relevant big data sets in a short period. Further issues are certain irresponsibility of doctors and patients; their truthfulness about facts happened in reality and the difference between these facts and what is written in a medical record. This often leads to incorrect and incomplete data sets in medical information systems. We conclude by stating that “Big Data” in medicine today cannot be “Big” as in other scientific areas. Re- searchers should try to collect relevant, truthful, and complete information in observable amount and time and perform their studies.

Keyphrases: Big Data, data analysis, deep learning, experiment design problems, medical data, Medical information systems, medicine

In: Oleg S. Pianykh, Alexey Neznanov, Sergei O. Kuznetsov, Jaume Baixeries and Svetla Boytcheva (editors). WDAM-2017. Workshop on Data Analysis in Medicine, vol 6, pages 61--66

BibTeX entry
  author    = {Svetlana Shchelykalina and Kirill Kiselev and Tatiana Zarubina},
  title     = {Big Data and machine learning in medicine: the main problems},
  booktitle = {WDAM-2017. Workshop on Data Analysis in Medicine},
  editor    = {Oleg S. Pianykh and Alexey Neznanov and Sergei Kuznetsov and Jaume Baixeries and Svetla Boytcheva},
  series    = {Kalpa Publications in Computing},
  volume    = {6},
  pages     = {61--66},
  year      = {2018},
  publisher = {EasyChair},
  bibsource = {EasyChair,},
  issn      = {2515-1762},
  url       = {},
  doi       = {10.29007/vvr8}}
Download PDFOpen PDF in browser