Data Analyst @Barclays | nikitagpardesi@gmail.com
|Data professional with 3+ years’ experience in ETL and visualization. Skilled in Python, SQL, and automating data solutions with strong ability to understand business requirements and work with cross-functional teams.
Data Analyst
AWS, Python, REST API, SQL, Tableau
• Automated infrastructure deployment with AWS CloudFormation, managed databases using RDS and integrated Chef for configuration management reducing deployment time by 60%.
• Set up and managed Amazon Redshift clusters and performed complex SQL querying for data analysis and reporting.
• Collaborated with cross-functional teams to provide product consumption and trend analysis contributing to a 10% cost reduction in operations.
• Developed a custom application in Python and Flask for data pipelines, significantly reducing data ingestion and cleaning efforts.
Data Management Assistant
SQL, Looker, Python
• Extracted, cleaned, and stored Chicago crime data with Python and GCP BigQuery SQL, reducing latency from 65 seconds to under 2 seconds.
• Developed data dashboards with Microsoft Looker and Plotly DASH, boosting decision-making for NGOs by 25%.
• Worked extensively with geospatial data and mapping in Python and Plotly.
Machine Learning Engineer Intern
Python, R, AWS, Docker
• Implemented machine learning algorithms (AdaBoost, XGBoost, ElasticNet, Random Forest, SGD, SVM, ARIMA, SARIMA, Exponential Smoothing) in Python and R, with hyperparameter tuning.
• Provide users with Model interpretability and explainability features using InterpretML in Python.
• Developed Docker images for real-time model training and testing, deploying models on AWS with SageMaker, Lambda, and EC2.
Data Science Executive
NLP, Python
• Built R&D dashboard for Reckitt – data collection, creating APIs in Flask and Django, database design and management, custom BERT models for automated categorization with a focus on News and Text Analytics (Natural Language Processing).
• Market sizing forecast models in Python, Social media Sentiment analysis.
• Hands-on experience with Atlassian Jira and Confluence for Agile Project Management.
• FMCG sector analysis (region and segment wise)- visualizations through insights, financial analysis in Python.
• Automated Company Earnings Call transcript’s collection, indexing and searching, summarization, text analytics and visualization. Lead a team of 10 data science interns.
MS in Data Science and Analytics
GPA: 3.78/4
Awarded Returning Student Scholarship
Coursework: Introduction to Data Science and Analytics, Statistics and Probability,
Optimization, Big Data and Cloud Computing, Statistical Inference, Advanced Data Visualization,
Natural Language Processing, Neural Networks, Money Banking and Financial Markets
Positions of Responsibility : Teaching Assistant – Machine Learning Application
Deployment, Advanced Math and Stat Computing, Blockchain Technologies | Student Ambassador |
CSET- Data Annotator Research Assistant
B.E. Electronics and Telecommunication
GPA: 8.24/10
Relevant Coursework: Database Management System (DBMS) & Big Data and Cloud Computing
Python: Data Collection- BeautifulSoup, Selenium, Scrapy; Data Cleaning and Pre-Processing- Numpy, Pandas, Pydantic, Cerberus, PySpark; Data storage- SQL, MySQL, NoSQL, HiveQL, MongoDB, csv, Parquet, Avro, JSON/YAML; Data Modeling- sklearn, PyTorch, Scipy, AutoML, Tensorflow, InterpretML; Rest API/ Web services- Flask, Django; Multiprocessing; Apache Spark; Data Visualization- Plotly, Dash
R, HTML, CSS, HighCharts JS, Natural Language Processing (NLTK, BERT, Hugging Face), Bash
Scikit, NLTK, SpaCy, TensorFlow, Keras, Django, Flask, Apache Spark,
Spark SQL, Hadoop MapReduce, HiveQL, AutoML, Scipy, Multiprocessing, InterpretML, LIME, Shapley
AWS: EC2, S3, Cloud9, Sagemaker, EMR, Hadoop User Experience (HUE), Lambda
Kubernetes, Docker, GIT, PostgreSQL, MySQL, SQLite, Tableau, Power BI, Atlassian- Jira, Confluence, UiPath, Automation Anywhere
A data viz mini project using Python and Highcharts. View Code.
View projectAnalyzing subreddit with Apache Spark and other Big Data Tools
View project