Home

SINDHU R - Senior Data Engineer
[email protected]
Location: Iowa City, Iowa, USA
Relocation: Open (515-605-7328)
Visa: H4EAD
Resume file: Sindhu DE_1772468716895.docx
Please check the file(s) for viruses. Files are checked manually and then made available for download.
8+ years of expertise in data engineering and data science, with a focus on developing scalable end-to-end ETL/ELT pipelines that include data collecting, ingestion, transformation, modeling, integration, and analytics for structured and unstructured data sources.
Extensive hands-on experience with the Hadoop ecosystem (HDFS, MapReduce, Spark, Scala, Hive, Pig, Sqoop, Flume, Oozie, Impala, HBase, YARN) and real-time data streaming with Kafka, Storm, and Spark Streaming
Extensive expertise creating secure and scalable cloud-native data systems using AWS (EC2, S3, EMR, RDS, Redshift, Glue, Lambda, IAM, CloudWatch, SQS, SNS), Azure (ADF, Data Lake, Databricks), and GCP (Compute Engine, Cloud Storage, Cloud SQL) technologies.
Experience developing batch and real-time data pipelines in PySpark, Spark SQL, Scala, and Python, as well as orchestrating processes in Airflow, NiFi, AWS Step Functions, and Azure Data Factory.
Deep understanding of data warehousing and dimensional modeling (Star Schema, Snowflake Schema), as well as the creation of enterprise data lakes and optimized data marts for analytics and business intelligence reporting.
Practical knowledge with Snowflake (SnowSQL, Snowpipe), Amazon Redshift, and performance tuning via complicated SQL queries, stored procedures, indexing, and query optimization approaches.
Extensive expertise with NoSQL and RDBMS databases such as MongoDB, Cassandra, DynamoDB, MySQL, PostgreSQL, Oracle, and SQL Server, assuring data integrity, migration, and validation.
Used Scikit-learn, TensorFlow, Keras, PyTorch, and SageMaker to create regression, clustering, PCA, SVM, decision trees, and deep learning models for predictive analytics and business insights.
Experience with data preparation, feature engineering, exploratory data analysis (EDA), statistical modeling, and large-scale data transformations with NumPy, Pandas, and PySpark.
Created interactive dashboards and reporting solutions with Tableau, AWS QuickSight, and Data Studio, allowing business stakeholders to gain real-time insights.
Implemented CI/CD and DevOps methods using Git, Jenkins, and cloud automation tools delivered scalable data products with monitoring, logging, and performance optimization.
Strong grasp of Agile processes, as well as outstanding communication skills, for coaching team members and providing enterprise-grade, secure, and high-performance data solutions.
Keywords: continuous integration continuous deployment sthree

To remove this resume please click here or send an email from [email protected] to [email protected] with subject as "delete" (without inverted commas)
[email protected];6915
Enter the captcha code and we will send and email at [email protected]
with a link to edit / delete this resume
Captcha Image: