Hi, I'm Eva Kaushik

Data Scientist @DXC Technology

AI & ML Architect| AWS Certified| GenAI & MLOps

Contact Me

About Me

My Introduction

I am a Data Scientist at DXC Technology, where I collaborate cross-functionally to develop AI solutions. My expertise lies in data science, with a strong proficiency in Python, R, Java/C++, SQL, AWS/Azure, and Generative AI. I excel in end-to-end CI/CD development.
In my spare time, I am passionate about reading and researching cutting-edge technologies. I have authored over eight publications in renowned journals.

42 Data Projects
Completed
58 Articles
Written
8 Published
Papers

Skills

My Technical Level

Development

All About the Core

Python

90%

Java

80%

PySpark

75%

R

70%

C++

40%

JavaScript

70%

Azure/AWS

85%

MS Excel

70%

GenAI

70%

MLOps

90%

Frameworks

Everyone Needs Support

NumPy

80%

pandas

90%

matplotlib

70%

scikit-learn

85%

Spark MLlib

70%

Pytorch

85%

Deep Graph Library

55%

OpenCV

65%

Pillow

65%

NLTK

60%

streamlit

80%

seaborn

70%

Flask

40%

Machine Learning

Theory, theory!

Linear and Logistic Regression

95%

Decision Trees

95%

Ensemble Models

90%

Clustering

65%

Convolutional Neural Networks

80%

Graph Neural Networks

60%

Recommender Systems

75%

Natural Language Processing

65%

Exploratory Data Analysis

90%

Multi-modal Learning

70%

Time Series

55%

Cloud and Engineering

Fly Fast & High!

AWS Sagemaker

65%

AWS EMR

75%

AWS Lambda

70%

Big Query

40%

Docker

60%

Apache Airflow

40%

Kafka

40%

Databases and Viz

Wow! Factor

MySQL

85%

AWS Redshift

75%

Amazon RDS

70%

Tableau

50%

Power BI

50%

Looker

60%

Qualification

My Personal Journey
Education
Work

B.Tech in Information Technology

GGSIPU, New Delhi, India
2018-2022

Higher Secondary in Science

Kendriya Vidyalaya Sainik Vihar, Delhi, India
2016-2018

Data Scientist

@Nestle (Contract with DXC Poland & India)
Feb 2024 - Present
What I did here

  • Deployed advanced AWS services and OpenAI models on Azure, optimizing autoscaling and resource management

  • Implemented 9 production-ready models using machine learning and statistical methods, addressing a $30 million use-case and enhancing predictive accuracy and performance

Data Scientist/ML Engineer

DXC Technology
Aug 2022 - Present
What I did here

  • At DXC, optimized annotation tools and GPU cost analysis reduced computational costs for computer vision projects

  • Developed an interactive PyQt5 robot and implemented GenAI-driven Digital Twin technology, reducing setup time by 50%

  • Engineered a real-time anomaly detection system with GNNs, achieving ~98.3% accuracy, and a scalable recommendation engine using Reinforcement Learning, boosting user engagement by 20%

Founding Pioneer

Dexignare
Jun 2021 - Jul 2022
What I did here

  • As Founding Pioneer at Dexignare (Jun 2021 - Jul 2022), integrated advanced data science methodologies to optimize UX/UI design, utilizing CNNs, RNNs, and GANs for enhanced user interaction

  • Developed high-precision predictive models, conducted comprehensive data analysis, and leveraged cloud platforms for scalable model deployment and real-time analytics, setting new industry benchmarks

Portfolio

My Projects

RETAIL-CHURN-AZURE-ML-STUDIO

Retail Churn Prediction using Azure ML Studio

  • Developed a predictive model for retail customer churn using Microsoft Azure Machine Learning Studio

  • Leveraging advanced data science methodologies. Key tasks included extensive data preprocessing (data cleansing, normalization, feature engineering) and implementing a Boosted Decision Tree algorithm for high-accuracy predictions

  • Utilized Python libraries (NumPy, Pandas, Scikit-learn, azureml-sdk) and integrated PowerBI for detailed visualizations and interactive dashboards. Employed cloud-based solutions for scalable model deployment, yielding actionable insights for customer retention strategies and optimizing business performance

  • Tech Stack


    View Code

    Tensorflow-Tool-Time-Series

    A Python Tool for RNN Modelling with Keras

  • Tensorlab simplifies time series data modeling using Keras and TensorFlow. It automates data preprocessing, model management, and training progress monitoring

  • Ideal for deep learning workflows, it supports CNNs, RNNs, and GANs, and integrates with Python libraries like NumPy and Pandas for efficient and high-precision results

  • Tech Stack


    Research Papers Referred

    View Code

    Cross-Platform Agile.NET

    Cross-Platform Agile Data Analytics for .NET

  • Squirrel supports datasets ranging from tiny to medium, providing tools for data acquisition, cleansing, and visualization. It aims to empower software developers to derive insights from data, bridging the gap between business users and complex data analytics. The framework is designed for ease of use, with APIs for various data operations, and is actively developed to incorporate user feedback and new features

  • Tech Stack


    View Code

    Experiment Ease

    Lightweight Framework for Machine Learning Experiments

  • It's a modular and highly configurable framework designed to streamline machine learning experiments. It supports various run environments, including processes, Docker containers, and Kubernetes pods, and integrates with multiple logging backends like MLflow, TensorBoard, and Trains

  • The framework simplifies experiment setup, execution, and monitoring, making it ideal for individual data scientists, researchers, and small teams. With features like systematic logging, parameterized experiments, and seamless integration with existing ML projects, ExperimentEase enhances productivity and efficiency in resource-constrained environments

  • Tech Stack

    View Code

    Finetuning-LLM

    A Comprehensive Guide to Finetuning Large Language Models

  • It covers the strategic use of finetuning, data preparation, instruction tuning, and training and evaluation processes. This course equips you to enhance model performance, ensuring efficient and effective use of fine-tuned language models for superior NLP project outcomes

  • Tech Stack

    View Code

    Markov-chain-Monte-Carlo-Algo

    Advanced Algorithms for Stochastic Processes

  • Provides a comprehensive collection of models and algorithms for Markov Chain Monte Carlo (MCMC) simulations

  • It includes implementations of the DWPSDE model, Ricker model, and adaptive updating algorithms, among others

  • This repository is designed to facilitate advanced research and practical applications in stochastic processes, offering utilities and tools for efficient MCMC simulations and analysis

  • Tech Stack

    View Code

    Jenkins-Codeflow

    Robust Automation Server for Continuous Integration and Delivery

  • It's a powerful automation server designed to streamline code deployment through continuous integration and delivery (CI/CD)

  • It supports various technologies, including Python, HTML, CSS, and Docker, to ensure seamless code flow and efficient deployment processes. The repository includes essential files like Dockerfile, Jenkinsfile, and deployment configurations, making it a comprehensive solution for automating and managing software development pipelines

  • Tech Stack

    View Code

    RAG-Ollama-Function

    Context-Aware Response Generation with Chroma DB and LLAMA2

  • In-Context Rag for contextual response refinement, and LLAMA2 for generating human-like responses. This system leverages function calling to interact with various components, ensuring accurate and contextually appropriate answers to user queries. It combines advanced data science techniques with large language models to enhance the efficiency and relevance of automated responses

  • Tech Stack

    View Code

    Publications

    Research Manuscripts

    Genetically Induced Biomaterial Advances in Medical Sciences(2023/pp 95-123)

    Protein-based monomers are among the most alluring options for achieving improved results with cutting-edge biomaterials since recent advances in bioartificial and microbiology technologies enable the very complicated, accurate design, and fabrication of protein-based biomaterials...

    Visit!

    Postmortem Concentrations-Distributed privacy-preserving blockchain authentication framework in cloud forensics(2024/Chapter 8/pp 161-184)

    With the development of technology, every industry has partnered with blockchain to deliver the greatest results, including healthcare, education, and finance. However, one of the most crucial sectors, namely, forensic sciences, is still behind...

    Visit!

    Cardiovascular disease prediction using machine learning techniques(Volume 2587, Issue 1)

    Now a days, healthcare plays a vital purpose in serving humanity. We should focus on health that aims at ensuring the highest attainable level of well-being. Furthermore, their equitable division centered on people's needs, disease prevention to treatment, restoration, and palliative care...

    Visit!

    Dynamics of EHR in M‐Healthcare Application(295-309/2023/Chapter 11)

    Health systems and policies play a critical role in defining how health services are provided, run, and impact health developments...

    Visit!

    Semantic-based approach for medical cyber-physical system (MCPS) with biometric authentication for secured privacy(pg. 237)

    Medical cyber-physical systems (MCPS), which seamlessly combine medical equipment, information systems, and physical settings, have been developed as a result of the rapid technological advancement that has transformed healthcare...

    Visit!

    Certifications

    Extra Courses I have Undertaken

    AWS Cloud Practitioner

    Issue Date: Mar 14, 2024

    View Certificate

    Dell Boomi Associate Administrator

    Issue Date: June 13, 2023

    View Certificate

    GSI NVIDIA Technologies Curriculum(2024)

    Issue Date: July 5, 2024

    View Certificate

    AWS Migration Hub Primer

    Issue Date: July 17 2024

    View Certificate

    Blog

    My Technical Articles

    OpenAI’s Whisper: Speech Recognition

    GenAI is bolstering Tech!

    Read it!

    Understanding Provisioned Throughput Units (PTUs) in Azure OpenAI

    PTU's are the next big thing!

    Read it!

    Evolution of Profanity Filters in Digital Communication

    Everyone knows it, but do you know it's Application?

    Read it!

    Advanced-Data Intelligence: Deploying Grafana with Azure’s Cutting-Edge Features

    The pandemic accelerated AI adoption — and made Big Tech richer — but did AI adoption happen in the places where it was needed?

    Read it!

    Contact Me

    Get in Touch

    Call Me

    +91-9667841697

    Location

    India