Find Jobs
Hire Freelancers

Senior Data Engineer

₹75000-150000 INR

Closed
Posted over 2 years ago

₹75000-150000 INR

Paid on delivery
Senior Data Engineer Technical Skills Languages – Python, SQL, Java, HCL, HTML/CSS/Javascript, Bash Database Technology – Spark, SybaseIQ, DB/2, Snowflake, Redshift, Hive, Presto, Oracle PL/SQL Tools – AWS, Terraform, Kubernetes, Docker, Jupyter, Intellij, vim, Git, SVN, Apache, nginx, Splunk, SSH · Primarily should have worked on the Data Lake, a petabyte-scale Data Warehouse built for Goldman Sachs’ unique requirements. The lake is used across hundreds of teams for many time-sensitive critical applications. · Derived a variety of SLOs and health indicators for the lake. Successfully optimized the lake, bringing ingestion time down under 15 minutes for more than 90% of users. · Designed an event-driven near real-time SLO monitor for the lake that processes millions of events a minute. · Crafted terraform AWS configurations from scratch to deploy key lake components to the cloud. · Developed and maintained a Jupyter notebook ecosystem on Kubernetes to support the SRE team. · Wrote Jupyter notebooks to analyze telemetry metrics, develop insights, and establish SLOs. Notebooks typically pulled in data using SQL or Pyspark, and further processed in Pandas. Visualizations were done using matplotlib. · Designed an automation framework for Jupyter notebooks to schedule, cache, serve, and email them to clients. · Implemented and maintained Prometheus metrics for high-level monitoring of the lake. These metrics are pushed to Grafana for visualization and Pagerduty for alerting. · Developed on Facebook’s Hadoop system through Hive and Presto, using Facebook’s internal ETL framework. · Maintained solutions with third parties for ad data ingestion and delivery, including coordination of data definitions and validation checks during ETL process. · Created APIs using hack (PHP) for upload endpoints. · Developed dashboards for sales lift data normalized across third parties using Tableau and internal tools. · Maintained ETL processes to solve bugs, data quality issues, CPU and space optimization, and adding columns to tables, which were mainly core ad metrics data sets that had a wide impact across the company. · Developed Facebook status tables which was a dataset that exceeded 150TB and over 1.2 trillion rows, from Facebook’s graph structure and curated into an easily digestible hive table, used by research teams for insights, sentiment analysis, and machine learning applications.
Project ID: 31572045

About the project

2 proposals
Remote project
Active 2 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
2 freelancers are bidding on average ₹112,500 INR for this job
User Avatar
Hi, We at Tecogno Solutions are a team of Passionate Data Science and Full Stack professionals having more than five years of combined experience in multiple areas including Backend, Frontend, Machine learning (ML), Computer vision(CV) and Artificial Intelligence (AI). We have developed and delivered multiple similar Machine Learning projects. We have a strong command over Python, Flask, Django, Dialogflow, Rasa, OpenCV, Google Vision API, Tesseract, Spacy, PHP, MySQL, Software architecture, Tensorflow, Spacy, Twilio, Node.js, RESTful API, NLP, CNN, AWS, GCP, Google API's etc. Thus we are capable of fulfilling all your requirements and for further information please refer to our profile or visit our website. Looking forward to work with you. Thanks
₹150,000 INR in 31 days
4.6 (3 reviews)
5.1
5.1

About the client

Flag of INDIA
Mysore, India
0.0
0
Member since Sep 21, 2021

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.