{"id":619,"date":"2019-11-09T10:16:36","date_gmt":"2019-11-09T10:16:36","guid":{"rendered":"http:\/\/guires.uk\/newsroom\/?p=619"},"modified":"2019-11-11T05:19:07","modified_gmt":"2019-11-11T05:19:07","slug":"predicting-diabetes-predictive-modeling-using-training-dataset","status":"publish","type":"post","link":"https:\/\/guires.uk\/newsroom\/use-cases\/predicting-diabetes-predictive-modeling-using-training-dataset\/","title":{"rendered":"Predicting Diabetes \u2013 Predictive Modeling using Training dataset."},"content":{"rendered":"\n<h3 class=\"h3color\">The Challenges<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Diabetes<\/strong> is a group of metabolic disorders in which there are high blood sugar levels over a prolonged period. Symptoms of high blood sugar include frequent urination, increased thirst, and increased hunger. If left untreated, diabetes can cause many complications. Acute complications can include diabetic ketoacidosis, hyperosmolar hyperglycemic state, or death. Serious long-term complications include cardiovascular disease, stroke, chronic kidney disease, foot ulcers, and damage to the eyes.<\/p>\n\n\n\n<h3 class=\"h3color\">Opportunity <\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">In order to understand\npredictive modelling, we tried to build a machine\nlearning model to accurately predict whether or not the patients in the dataset\nhave diabetes or not. The datasets consist of several\nmedical predictor variables and one target variable, Outcome. Predictor\nvariables includes the number of pregnancies the patient has had, their BMI,\ninsulin level, age, and so on.<\/p>\n\n\n\n<ul class=\"wp-block-list\"><li><strong>Pregnancies<\/strong>: Number of times pregnant<\/li><li><strong>Glucose<\/strong>: Plasma glucose concentration 2 hours\nin an oral glucose tolerance test<\/li><li><strong>blood pressure<\/strong>: Diastolic blood\npressure (mm Hg)<\/li><li><strong>SkinThickness<\/strong>: Triceps skinfold thickness (mm)<\/li><li><strong>Insulin<\/strong>: 2-Hour serum insulin (mu U\/ml)<\/li><li><strong>BMI<\/strong>: Body mass index (weight in\nkg\/(height in m)^2)<\/li><li><strong>DiabetesPedigreeFunction<\/strong>: Diabetes\npedigree function<\/li><li><strong>Age<\/strong>: Age (years)<\/li><li><strong>Outcome<\/strong>: Class variable (0 or 1)<\/li><\/ul>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>First five rows of the dataset look like this &#8211;<\/strong><\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"624\" height=\"188\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-49.png\" alt=\"\" class=\"wp-image-620\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-49.png 624w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-49-300x90.png 300w\" sizes=\"auto, (max-width: 624px) 100vw, 624px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>DataFrame.describe()<\/strong>&nbsp;method generates descriptive statistics that summarize the central tendency, dispersion and shape of a dataset\u2019s distribution, excluding NaN values. This method tells us a lot of things about a dataset.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"624\" height=\"211\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-50.png\" alt=\"\" class=\"wp-image-621\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-50.png 624w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-50-300x101.png 300w\" sizes=\"auto, (max-width: 624px) 100vw, 624px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">A question which arises here is \u2013<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Can the minimum value of below-listed\ncolumns be zero (0)?<\/h4>\n\n\n\n<p class=\"wp-block-paragraph\">On these columns, a value of zero does\nnot make sense and thus indicates a missing value. Following columns or\nvariables have an invalid zero value:<\/p>\n\n\n\n<ol class=\"wp-block-list\"><li>Glucose<\/li><li>blood pressure<\/li><li>skin thickness<\/li><li>Insulin<\/li><li>BMI<\/li><\/ol>\n\n\n\n<p class=\"wp-block-paragraph\">So we can say that the zero elements might be the null values in the dataset. Let\u2019s find out the Total null values in the dataset with respect to each feature.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"256\" height=\"158\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-51.png\" alt=\"\" class=\"wp-image-622\"\/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">We need to\nfigure out how to impute these NaN values so that we can use them in modelling.\nThere are many ways to impute NaN values in the dataset, such as&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>Filling in median values of\nthe features<\/li><li>Filling in the mean value\nof the total feature<\/li><li>Removing the feature column\n(if more than 70% of feature is null)<\/li><\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Here, to find out which way to do, we can use <strong>hist<\/strong> from <strong>matplotlib<\/strong> to know how the data is spread for a particular feature. <\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"623\" height=\"337\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-52.png\" alt=\"\" class=\"wp-image-623\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-52.png 623w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-52-300x162.png 300w\" sizes=\"auto, (max-width: 623px) 100vw, 623px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">We will be imputing NaN values for the columns in accordance with their distributions.\nGlucose feature will use mean of the column as it is normally distributed,\nInsulin is skewed towards left and does have more values from 0-100, so we will\nuse median on it, same for BMI and Skin thickness features and&nbsp; at last for blood pressure we will use mean\non it.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The results are \u2013<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"660\" height=\"360\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-53.png\" alt=\"\" class=\"wp-image-624\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-53.png 660w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-53-300x164.png 300w\" sizes=\"auto, (max-width: 660px) 100vw, 660px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Let us see our <strong>outcome<\/strong> feature (this is our target variable) \u2013 <\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"297\" height=\"211\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-54.png\" alt=\"\" class=\"wp-image-625\"\/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Now let us see the Correlation between each variable to analyse our data well, we are using a <strong>seaborn<\/strong> package to get this correlation.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"655\" height=\"564\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-55.png\" alt=\"\" class=\"wp-image-626\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-55.png 655w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-55-300x258.png 300w\" sizes=\"auto, (max-width: 655px) 100vw, 655px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Glucose and Insulin are the important features which might have a huge impact on the target variable (Outcome).<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"630\" height=\"405\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-56.png\" alt=\"\" class=\"wp-image-627\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-56.png 630w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-56-300x193.png 300w\" sizes=\"auto, (max-width: 630px) 100vw, 630px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Standardisation<\/strong>:&nbsp;&nbsp;&nbsp;&nbsp; this is an important step before modelling as the features are needed to be in a particular range. Consider this example, if feature age 50 and insulin 0.5 is fed into ML algorithm the machine thinks that age is more important as it has higher value. For this reason, there is a need to normalize the dataset leaving out the target column. After completing this standardisation process the dataset looks like this- <\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"622\" height=\"184\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-57.png\" alt=\"\" class=\"wp-image-628\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-57.png 622w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-57-300x89.png 300w\" sizes=\"auto, (max-width: 622px) 100vw, 622px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Predictive modelling:<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We will be using different machine learning algorithms, ensemble\nmodelling, cross-validation(K-fold) to find out better accuracy for predicting\nif a person has Diabetes or not.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>SVM:&nbsp;&nbsp; <\/strong>we will be using support vector machine from sklearn python package. There are two kernels in svm namely Linear and rbf, and we will be testing both the algorithms.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"624\" height=\"276\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-58.png\" alt=\"\" class=\"wp-image-629\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-58.png 624w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-58-300x133.png 300w\" sizes=\"auto, (max-width: 624px) 100vw, 624px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Logistic Regression: <\/strong>this algorithm is also from sklearn python package, here the main hyperparameters are C and Penalty(L1 or L2)<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"624\" height=\"157\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-59.png\" alt=\"\" class=\"wp-image-630\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-59.png 624w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-59-300x75.png 300w\" sizes=\"auto, (max-width: 624px) 100vw, 624px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>KNN: <\/strong>K- nearest neighbour is a machine learning algorithm which uses distance metrics to find the closest neighbours of our features, we need to find the value of k to get the best accuracy at a particular value.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"658\" height=\"438\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-60.png\" alt=\"\" class=\"wp-image-631\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-60.png 658w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-60-300x200.png 300w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-60-600x400.png 600w\" sizes=\"auto, (max-width: 658px) 100vw, 658px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Random Forest<\/strong>: this algorithm is a type of ensemble model which works well for classification problems. Let us first find the important features and then predict the outcome. These are the feature importance of our dataset &#8211;  <\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"624\" height=\"235\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-61.png\" alt=\"\" class=\"wp-image-632\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-61.png 624w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-61-300x113.png 300w\" sizes=\"auto, (max-width: 624px) 100vw, 624px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Here we can see that Glucose is the most important feature in the dataset if the Glucose level increases or Decreases then there is a huge possibility of getting Diabetes. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Accuracy:  <\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"683\" height=\"58\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-62.png\" alt=\"\" class=\"wp-image-633\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-62.png 683w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-62-300x25.png 300w\" sizes=\"auto, (max-width: 683px) 100vw, 683px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Cross-validation<\/strong><strong>:<\/strong> Cross-validation\nis a resampling procedure used to evaluate machine learning models on a limited\ndata sample.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The general procedure is as follows:<\/p>\n\n\n\n<ol class=\"wp-block-list\"><li>Shuffle the dataset randomly.<\/li><li>Split the dataset into k groups.<\/li><li>For each unique group:<ol><li>Take the group as a holdout or test data set.<\/li><\/ol><ol><li>Take the remaining groups as a training data set.<\/li><\/ol><ol><li>Fit a model on the training set and evaluate it on the test set.<\/li><\/ol><ol><li>Retain the evaluation score and discard the model.<\/li><\/ol><\/li><li>Summarize the skill of the model using the sample of model evaluation\nscores.<\/li><\/ol>\n\n\n\n<p class=\"wp-block-paragraph\">Now we will be using this cross validation for our algorithms and check which gives us better accuracy \u2013 let K be 10<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"669\" height=\"358\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-63.png\" alt=\"\" class=\"wp-image-634\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-63.png 669w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-63-300x161.png 300w\" sizes=\"auto, (max-width: 669px) 100vw, 669px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">We can infer\nthat the best accuracy for this model was from k-fold(Logistic regression) with\n77.5%<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"465\" height=\"244\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-64.png\" alt=\"\" class=\"wp-image-635\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-64.png 465w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-64-300x157.png 300w\" sizes=\"auto, (max-width: 465px) 100vw, 465px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Ensemble modelling: &nbsp;&nbsp;<\/strong>we will be using Voting classifier in ensemble modeling , the two algorithms which got high accuracy before will be used in this \u2013 Linear SVM and Logistic Regression.<\/p>\n\n\n\n<figure class=\"wp-block-image\"><img loading=\"lazy\" decoding=\"async\" width=\"655\" height=\"156\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-65.png\" alt=\"\" class=\"wp-image-636\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-65.png 655w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-65-300x71.png 300w\" sizes=\"auto, (max-width: 655px) 100vw, 655px\" \/><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">While using\nensemble model, we couldn\u2019t get much better accuracy than single classifier, so\nthis method will not be used further.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Best model:<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">As we have tried all the algorithms and method that we have mentioned above , best accuracy is got from Logistic Regression using K-Fold cross-validation.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"624\" height=\"154\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-66.png\" alt=\"\" class=\"wp-image-637\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-66.png 624w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-66-300x74.png 300w\" sizes=\"auto, (max-width: 624px) 100vw, 624px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">&nbsp;&nbsp;&nbsp; <strong>TP<\/strong> : 439 ; <strong>TN<\/strong> : 150 ; <strong>FP<\/strong> : 61 ; <strong>FN<\/strong> : 118 ;<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"405\" height=\"282\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-67.png\" alt=\"\" class=\"wp-image-638\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-67.png 405w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-67-300x209.png 300w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-67-320x224.png 320w\" sizes=\"auto, (max-width: 405px) 100vw, 405px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>AUC \u2013 ROC curve: <\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">AUC\u2013ROC curve is the model selection metric for bi\u2013multi-class classification problem. ROC is a probability curve for different classes. ROC tells us how good the model is for distinguishing the given classes, in terms of the predicted probability. A typical ROC curve has False Positive Rate (FPR) on the X-axis and True Positive Rate (TPR) on the Y-axis. The area covered by the curve is the area between the orange line (ROC) and the axis. This area covered is AUC. The bigger the area covered, the better the machine learning models is at distinguishing the given classes. Ideal value for AUC is 1.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"442\" height=\"292\" src=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-68.png\" alt=\"\" class=\"wp-image-639\" srcset=\"https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-68.png 442w, https:\/\/guires.uk\/newsroom\/wp-content\/uploads\/2019\/11\/image-68-300x198.png 300w\" sizes=\"auto, (max-width: 442px) 100vw, 442px\" \/><\/figure><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Further\nproceedings: <\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>This dataset is small\nand did not capture features well. Therefore, there is a need for more data in\norder to train the model to gain rich insights on the features.<ul><li>We can add few more\nfeatures which can be useful in predictions, new features should be added with\nconsent of Doctors which actually affects the outcome accurately. &nbsp;<\/li><\/ul><\/li><\/ul>\n\n\n\n<h3 class=\"h3color\"> Why Guires<\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Guires\nData analytics mission is to democratize AI for healthcare industries. The team\nof data science expert use the power of AI to solve business and social\nchallenges.&nbsp; We are a pioneer in the\nresearch field for more than fifteen years and offer end to end solution for\nthe firm to set the direction for the company and support analytical frameworks\nfor better understanding and making strategic decisions. We provide appropriate\nsolutions using your existing volume of data available in varying degree of\ncomplexities that cannot be processed using traditional technologies,\nprocessing methods, or any commercial off the shelf solutions. By outsourcing\nbig data to us, we can analyze events that have happened within and outside an\norganization and correlate those to provide near accurate insights into what\ndrove the outcome. Our big data analytics solutions are fast, scalable and\npossess flexible processing.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">We use powerful algorithms, business rules, and statistical models.&nbsp; We work with text, image, audio, video and\nmachine data. Our medical experts understand the different layers of data being\nintegrated and what granularity levels of integration can be completed to\ncreate the holistic picture. Our team creates the foundational structure for\nanalytics and visualization of the data. Our data analytics team is well\nequipped with advanced mathematical degrees, statisticians with multiple\nspecialist degrees who can apply cutting-edge data mining techniques thereby\nenabling our clients to gain rich insights into existing customers and unearth\nhigh potential prospects.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">How\ncan you make the most of predictive analytics? Let us help you get started.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Get\npredictive analytics working for you. Contact Guires expert.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">How\ncan you make the most of predictive analytics? Let us help you get started.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Get\npredictive analytics working for you. Contact Guires expert.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The Challenges Diabetes is a group of metabolic disorders in which there are high blood sugar levels over a prolonged period. Symptoms of high blood sugar include frequent urination, increased thirst, and increased hunger. If left untreated, diabetes can cause many complications. Acute complications can include diabetic ketoacidosis, hyperosmolar hyperglycemic state, or death. Serious long-term [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":624,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[55],"tags":[],"class_list":["post-619","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-use-cases"],"_links":{"self":[{"href":"https:\/\/guires.uk\/newsroom\/wp-json\/wp\/v2\/posts\/619","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/guires.uk\/newsroom\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/guires.uk\/newsroom\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/guires.uk\/newsroom\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/guires.uk\/newsroom\/wp-json\/wp\/v2\/comments?post=619"}],"version-history":[{"count":2,"href":"https:\/\/guires.uk\/newsroom\/wp-json\/wp\/v2\/posts\/619\/revisions"}],"predecessor-version":[{"id":760,"href":"https:\/\/guires.uk\/newsroom\/wp-json\/wp\/v2\/posts\/619\/revisions\/760"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/guires.uk\/newsroom\/wp-json\/wp\/v2\/media\/624"}],"wp:attachment":[{"href":"https:\/\/guires.uk\/newsroom\/wp-json\/wp\/v2\/media?parent=619"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/guires.uk\/newsroom\/wp-json\/wp\/v2\/categories?post=619"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/guires.uk\/newsroom\/wp-json\/wp\/v2\/tags?post=619"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}