Nikita Pardeshi

Data Analyst @Barclays | nikitagpardesi@gmail.com

|

Data professional with 3+ years’ experience in ETL and visualization. Skilled in Python, SQL, and automating data solutions with strong ability to understand business requirements and work with cross-functional teams.

Experience

June 2023- Present

Barclays Bank PLC

Data Analyst

AWS, Python, REST API, SQL, Tableau

• Automated infrastructure deployment with AWS CloudFormation, managed databases using RDS and integrated Chef for configuration management reducing deployment time by 60%.

• Set up and managed Amazon Redshift clusters and performed complex SQL querying for data analysis and reporting.

• Collaborated with cross-functional teams to provide product consumption and trend analysis contributing to a 10% cost reduction in operations.

• Developed a custom application in Python and Flask for data pipelines, significantly reducing data ingestion and cleaning efforts.

January 2023- May 2023

Northwestern University

Data Management Assistant

SQL, Looker, Python

• Extracted, cleaned, and stored Chicago crime data with Python and GCP BigQuery SQL, reducing latency from 65 seconds to under 2 seconds.

• Developed data dashboards with Microsoft Looker and Plotly DASH, boosting decision-making for NGOs by 25%.

• Worked extensively with geospatial data and mapping in Python and Plotly.

June 2022- August 2022

Ready Tensor

Machine Learning Engineer Intern

Python, R, AWS, Docker

• Implemented machine learning algorithms (AdaBoost, XGBoost, ElasticNet, Random Forest, SGD, SVM, ARIMA, SARIMA, Exponential Smoothing) in Python and R, with hyperparameter tuning.

• Provide users with Model interpretability and explainability features using InterpretML in Python.

• Developed Docker images for real-time model training and testing, deploying models on AWS with SageMaker, Lambda, and EC2.

January 2021- July 2021

SkyQuest Technology Group

Data Science Executive

NLP, Python

• Built R&D dashboard for Reckitt – data collection, creating APIs in Flask and Django, database design and management, custom BERT models for automated categorization with a focus on News and Text Analytics (Natural Language Processing).

• Market sizing forecast models in Python, Social media Sentiment analysis.

• Hands-on experience with Atlassian Jira and Confluence for Agile Project Management.

• FMCG sector analysis (region and segment wise)- visualizations through insights, financial analysis in Python.

• Automated Company Earnings Call transcript’s collection, indexing and searching, summarization, text analytics and visualization. Lead a team of 10 data science interns.

Education

2021-2023

Georgetown University

MS in Data Science and Analytics

GPA: 3.78/4
Awarded Returning Student Scholarship
Coursework: Introduction to Data Science and Analytics, Statistics and Probability, Optimization, Big Data and Cloud Computing, Statistical Inference, Advanced Data Visualization, Natural Language Processing, Neural Networks, Money Banking and Financial Markets
Positions of Responsibility : Teaching Assistant – Machine Learning Application Deployment, Advanced Math and Stat Computing, Blockchain Technologies | Student Ambassador | CSET- Data Annotator Research Assistant

2016-2020

Thadomal Shahani Engineering College, University of Mumbai

B.E. Electronics and Telecommunication

GPA: 8.24/10
Relevant Coursework: Database Management System (DBMS) & Big Data and Cloud Computing

Skills

Programming

Python: Data Collection- BeautifulSoup, Selenium, Scrapy; Data Cleaning and Pre-Processing- Numpy, Pandas, Pydantic, Cerberus, PySpark; Data storage- SQL, MySQL, NoSQL, HiveQL, MongoDB, csv, Parquet, Avro, JSON/YAML; Data Modeling- sklearn, PyTorch, Scipy, AutoML, Tensorflow, InterpretML; Rest API/ Web services- Flask, Django; Multiprocessing; Apache Spark; Data Visualization- Plotly, Dash
R, HTML, CSS, HighCharts JS, Natural Language Processing (NLTK, BERT, Hugging Face), Bash

Frameworks

Scikit, NLTK, SpaCy, TensorFlow, Keras, Django, Flask, Apache Spark,
Spark SQL, Hadoop MapReduce, HiveQL, AutoML, Scipy, Multiprocessing, InterpretML, LIME, Shapley