Bhavini Vyas

Data Engineer · Data Enthusiast
Austin, TX - 78759 · bhavini266@gmail.com ·

Analytical professional with excellent problem-solving skills and strong attention to detail. Well-rounded experience as both a Data Analyst and Software Engineer. Earned Certificate in Data Analytics and Visualization from UT – Austin in January 2020. Passionate about extracting valuable insights from data to help companies make business decisions and drive growth opportunities. Comfortable working independently and as part of a team, and able to multi-task in a fast-paced environment.

Hands-on experience with all aspects of Data Analytics/Data Engineering field that includes ETL - large data collection from variety of sources like various APIs, Data Modelling – cleaning, manipulating unorganized data, loading in AWS cloud-based platform, analyzing data, finding insights. Make predictions using different machine learning algorithms and generate summary reports and dashboards with statistical analysis/different visualization techniques and finally embedding model to web app.


Experience

DATA ENGINEER

H-E-B DIGITAL

Jan 2022 - Present

DATA ANALYST

ACCENTURE

  • Analyze current manual process surrounding Operations for optimization opportunities.
  • Built automated ETL data pipelines using Python and SQL Alchemy and scheduled tasks to run those automatically that reduced manual effort and saved time.
  • Designed, developed automated and user-friendly dynamic Tableau dashboards/reports with increasing accuracy and efficiency by 90%.
  • Extracted data from various data sources like SQL Server and Oracle with optimized query as well as implemented automated and scaled reporting/process solutions and data infrastructure improvements to meet business needs.
  • Deliver insights and business object indicators (KPI) through daily, weekly and monthly reporting and presented to stakeholders and leadership.
Technologies: Python, Pandas, Advanced SQL, SQL Alchemy, Tableau, Facebook prophet timeseries model, Advanced Excel, VBA

Jun 2020 - Jan 2022

SOFTWARE ENGINEER

BAYER PHARMACEUTICALS

  • Derived software requirements from various stakeholders such as marketing, service, system engineering and human factors.
  • Implemented high quality software using best practices such as Service Oriented Architecture and Object-Oriented Concepts.
  • Implemented injector adapter components to capture ongoing injection data and store them into the database.
  • Took initiative to develop auto upgrade feature to satisfy clients' needs which conducted version check, dependency upgrade, backup and restore of configuration that reduced 30% of installation time and eliminated error prone manual installation process and was able to develop warm client relationships.
  • Transformed business needs into analytical tasks and analyzed the system logs to predict preventive maintenance which helped in reducing the labor and service cost as well as improved customer satisfaction. Documented valuable insights and presented to various stakeholders.
  • Wrote unit tests, integration tests.
Technologies: C#, SQL, Postgres, Python, Java Script

January 2014 - September 2016

SOFTWARE QUALITY ANALYST

BAYER PHARMACEUTICALS

  • Evaluated product level requirements by coordinating with system engineering team and designed/developed appropriate QA plan.
  • Developed software tests for software projects with every requirement tracing.
  • Monitored and tracked defects in software to keep clean track of defects and improve software quality.
  • Created documentation for QA procedures and reported results of testing to quality and software team to eliminate software defects.

August 2013 - January 2014

Education

University of Texas, Austin-TX

Certificate - Data Analytics and Visualization
Accelerated program focused on gaining technical programming skills in Advanced Excel, VBA, Python, JavaScript, SQL Databases, Tableau, Spark, Big Data and Machine Learning-all about data analytics/science.

Grade: A+

July 2019 - Jan 2020

SAURASHTRA UNIVERSITY, INDIA

BECHLORE OF ENGINEERING
Electronics and Communications Engineering

GPA: 3.92


Skills

Programming Languages and Libraries

  • Python
  • Pandas
  • Scikit-Learn
  • Matplotlib
  • Seaborn
  • PySpark
  • SQL
  • VBA-Macros
  • C#

Visualization and Reporting

  • Tableau
  • JS-D3
  • Plotly
  • Leaflet
  • Advanced Excel

Databases, Datawarehouse, BigData and cloud

  • Postgres
  • MySQL
  • AWS-Redshift
  • Apache Airflow
  • SQL Server
  • SSIS
  • Apache Cassandra
  • MongoDB
  • Snowflake
  • Spark
  • DataBricks
  • MS-Access


Machine / Deep Learning - NLP

  • Linear/Logistic Regression
  • Support Vector Machine (SVM)
  • Random Forest
  • KNN
  • NLTK
  • JohnSnow

Web Technology and Framework

  • Java Script
  • Flask
  • SQL Alchemy
  • HTML
  • CSS
  • JSON
  • Bootstrap
  • Beautiful Soup
  • ASP.NET
  • WCF

Others

  • Statistical Analysis
  • Jupiter Notebook
  • Google Colab
  • Visual Studio Code
  • Github
  • Mercurial
  • Jira
  • Doors


Projects

Card image

Sparkify-Data WareHouse AWS-Redshift(ETL)

An implementation of a Data Warehouse leveraging AWS RedShift. This project builds an ETL pipeline for the database hosted on AWS Redshift that extracts their data from multiple JSON files residing in S3 buckets, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team to continue finding insights in what songs their users are listening to.


Technologies: AWS-Redshift, S3, SQL, Python, Pandas

Git Hub
Card image

TRAFFIC ACCIDENT SEVERITY PREDICTION

This project is used to predict US traffic accidents severity based on weather conditions like visibility, temperature, and weather categories as well as based on US location and the day of the week. This project can be used for numerous applications such as real-time accident severity prediction based on environment factors. Studying accident hotspot locations and their severity.


Technologies: PySpark, Machine Learning, AWS-Postgres, Databricks

Git Hub
Card image

COMMUNITY HEALTH STATUS INDICATORS CORRELATIONS

Heroku deployed web app with interactive visualization to get insight of correlations between various Risk Factors(Obesity, High Blood Pressure, Diabetes), Demographic status and Access to care.


Technologies: Tableau, JS-D3, Python, Pandas, Flask, AWS-Postgres, SQL Alchemy

Git Hub Live Website

Card image

Weather Analysis

Analysed the weather as we go towards equator. Extracted weather data from OpenWeatherMap API. Displyed results in website.


Technologies: Python, Pandas, Web API, Beautiful Soup, Bootstrap, JavaScript

Git Hub Website
Card image

CitiBike Ridership Analysis

Generated interactive reports that can improve the city program (largest bike sharing program in the United States) using Tableau with use of filter, calculated fields, set, group, map layers. Analysis has been done by age, gender, season, trip duration, user type, bike usage over time, % change in ridership.


Technologies: Tableau, Python, Pandas

Git Hub Tableau Pub.
Card image

Mission To Mars

Extract necessary information by scraping NASA's various websites, transformed and saved in MongoDB. Finally displayed to webpage.


Technologies: MongoDB, Python, PyMongo, ETL-Web Scraping, Flask, Beautiful soup

Git Hub

Online Learning / Certifications

  • AWS Cloud Quest - Data Analytics
  • Udacity Data Engineering Nano-Degree Program.
  • Certification - Data Analytics And Visulization, UT Austin, TX
  • Advanced SQL for Data Scientists - LinkedIn Learning
  • Machine Learning and AI Foundations: Predictive Modeling Strategy at Scale - LinkedIn Learning
  • Essential Math for Machine Learning: Python Edition - LinkedIn Learning
  • Snowflake Cloud Datawarehouse Fundamentals - Udemy
  • Microsoft Certfied Technology Specialist (MCTS)