- Big Data Engineer/Spark Engineer at Staples (Dec 2018 – present)
- Big Data Engineer at Staples (Dec 2018 – present)
- Big Data/Spark Engineer at DentaQuest (Jan 2018 – Nov 2018)
- Hadoop Engineer/Data Engineer at Credence Analytics (Jul 2016 – Nov 2017)
- Data Engineer at Quantiphi Analytics (Mar 2015 – Jun 2016)
Want to learn more about this candidate? Let's talk
Darshan Shah
Big Data/Hadoop/Spark/Data Engineer
Skills
- Big Data Tools: Hadoop, Map Reduce, HDFS 2, Hive, Pig, HBase, Sqoop, Spark, Kafka
- OLAP & ETL Tools: Tableau, Spyder, Spark, SSIS, Informatica Power Center
Data Modelling Tools Microsoft Visio, ER Studio, Erwin - Python and R libraries: R-tidyr, tidyverse, dplyr reshape, lubridate, Python – beautiful Soup, numpy, scipy, matplotlib, python-twitter, pandas, scikit-learn, keras, tensorflow, NLTK
- Languages: SQL, Python, R, Scala
- Data Warehouse schemas: Star Schema, Snowflake schema
- Database: MySQL, Hive, Teradata, MS Access, SQL Server, Oracle, Mongo DB, PostgreSQL
- Reporting Tools: MS Excel, Tableau, Power BI, QlikView, Qlik Sense, D3, SSRS, SSIS
- Cloud Computing Tools: Amazon Web Services (AWS), Microsoft Azure, Google Cloud Platform (GCP)
- Machine Learning:Regression, Clustering, MLlib, Linear Regression, Logistic Regression, Decision Tree, SVM, Naive Bayes, KNN, K-Means, Random Forest, and Gradient Boost & Adaboost, Neural Networks and Time Series Analysis.
- Data Science Tools: Machine Learning, Deep Learning, Data Warehouse, Data Mining, Data Analysis, Big data, Visualizing, Data Munging, Data Modelling
- Operating Systems: Windows, Linux, Mac OS