Big Data Engineer (The Data Pipeline Innovator)

Company: Unreal Gigs
Location: San Francisco
Posted on: November 11, 2024

Job Description:

Are you passionate about handling massive datasets and building the infrastructure that enables complex data analysis and machine learning at scale? Do you excel in creating robust, scalable data pipelines that fuel data-driven decision-making? If you're ready to tackle the challenges of big data, our client has the perfect role for you. We're seeking a Big Data Engineer (aka The Data Pipeline Innovator) to architect and maintain high-performance data systems that empower analytics and support advanced data processing needs.As a Big Data Engineer at our client, you'll collaborate with data scientists, analysts, and software engineers to design, implement, and optimize big data platforms. Your expertise in data engineering, distributed systems, and cloud infrastructure will be critical to ensuring that our data ecosystem is efficient, reliable, and scalable.Key Responsibilities:

Design and Build Scalable Data Pipelines:

Architect and implement data pipelines for ETL processes using tools like Apache Spark, Kafka, and Hadoop. You'll create data workflows that handle high-volume, high-velocity data and ensure seamless integration across systems.
Optimize Big Data Storage and Processing:
- Develop and manage data storage solutions (e.g., HDFS, S3, Cassandra) that are optimized for performance and cost-efficiency. You'll configure distributed processing systems to support efficient data retrieval and transformation.
- Collaborate on Data Strategy and Integration:
  - Work closely with data scientists, analysts, and other engineers to align big data architecture with analytics goals. You'll ensure data availability and integrity across systems to support business objectives.
  - Implement Data Quality and Governance Standards:
    - Develop processes and tools to monitor data quality and enforce data governance policies. You'll ensure data is accurate, reliable, and secure through regular checks and validation processes.
    - Enhance Data Processing with Automation:
      - Use tools like Apache Airflow or AWS Glue to automate data workflows and reduce manual processing. You'll implement scripts and automation that streamline data handling and improve efficiency.
      - Monitor and Troubleshoot Data Systems:
        
        Use monitoring tools to track system performance and address issues proactively. You'll troubleshoot and resolve any bottlenecks or failures to maintain optimal data processing capabilities.
        
        Stay Updated on Big Data Trends and Technologies:
        
        Keep up with advancements in big data technologies and tools. You'll integrate new techniques and platforms that align with business needs and promote innovation. Required Skills:
        
        Big Data Platform Proficiency: Extensive experience with big data technologies such as Apache Spark, Hadoop, Kafka, and Hive. You're skilled at handling high-volume data and distributed processing.
        
        Data Pipeline and ETL Knowledge: Proven ability to design, build, and maintain ETL processes for massive datasets. You can handle both real-time and batch data processing requirements.
        
        Programming and Scripting: Proficiency in programming languages like Python, Java, or Scala for data processing and automation. Experience with SQL for data querying and manipulation is essential.
        
        Cloud Data Services Expertise: Familiarity with cloud platforms such as AWS, GCP, or Azure, including their big data and storage services (e.g., S3, BigQuery, Azure Data Lake).
        
        Data Quality and Governance: Strong understanding of data quality standards and governance practices, with experience in implementing data validation and monitoring frameworks. Educational Requirements:
        
        Bachelor's or Master's degree in Computer Science, Data Engineering, Information Technology, or a related field. Equivalent experience in data engineering or big data management may be considered.
        
        Certifications in big data or cloud technologies (e.g., Cloudera Certified Data Engineer, AWS Certified Big Data - Specialty, Google Professional Data Engineer) are a plus. Experience Requirements:
        
        5+ years of experience in data engineering, with at least 3+ years focusing on big data technologies and high-scale data environments.
        
        Experience in distributed systems and large-scale data storage management.
        
        Familiarity with containerization (Docker, Kubernetes) for deploying data processing environments is advantageous.
        
        Health and Wellness: Comprehensive medical, dental, and vision insurance plans with low co-pays and premiums.
        
        Paid Time Off: Competitive vacation, sick leave, and 20 paid holidays per year.
        
        Work-Life Balance: Flexible work schedules and telecommuting options.
        
        Professional Development: Opportunities for training, certification reimbursement, and career advancement programs.
        
        Wellness Programs: Access to wellness programs, including gym memberships, health screenings, and mental health resources.
        
        Life and Disability Insurance: Life insurance and short-term/long-term disability coverage.
        
        Employee Assistance Program (EAP): Confidential counseling and support services for personal and professional challenges.
        
        Tuition Reimbursement: Financial assistance for continuing education and professional development.
        
        Community Engagement: Opportunities to participate in community service and volunteer activities.
        
        Recognition Programs: Employee recognition programs to celebrate achievements and milestones.
        #J-18808-Ljbffr

Keywords: Unreal Gigs, Dublin , Big Data Engineer (The Data Pipeline Innovator), Engineering , San Francisco, California

Click here to apply!

Didn't find what you're looking for? Search again!

Let San Francisco recruiters find you. Post your resume for free!

Get San Francisco Engineering jobs via email.

View more Dublin Engineering jobs

Other Engineering Jobs

Staff Machine Learning Engineer
Description: EvenUp is on a mission to support injury law firms across America in providing a consistent and high standard of representation, ensuring that every injury victim who seeks legal assistance can expect (more...)
Company: EvenUp
Location: San Francisco
Posted on: 11/15/2024

Senior Reliability Engineer
Description: The Company br SPAN develops products that accelerate the rapid adoption of renewable energy in the home. The flagship SPAN Smart Panel is the first true evolution for the traditional home electric (more...)
Company: Span
Location: San Francisco
Posted on: 11/15/2024

Senior Natural Language Processing (NLP) Engineer
Description: Company Overview: Welcome to the forefront of Natural Language Processing NLP innovation Our company is dedicated to harnessing the power of NLP to transform industries and revolutionize how people (more...)
Company: Unreal Gigs
Location: San Francisco
Posted on: 11/15/2024

Salary in Dublin, California Area | More details for Dublin, California Jobs |Salary

Renewable Energy Engineer (The Sustainable Power Pioneer)
Description: Do you have a passion for driving the transition to clean, renewable energy sources Are you excited about designing innovative systems that harness the power of wind, solar, geothermal, or hydropower (more...)
Company: Unreal Gigs
Location: San Francisco
Posted on: 11/15/2024

Solutions Engineer
Description: Job Summary: The Solutions Engineer is responsible for providing pre- and post-sales support for our rapidly growing customer base resulting in revenue growth. Is responsible for the technical relationships (more...)
Company: Menlo Ventures
Location: San Francisco
Posted on: 11/15/2024

Lead Machine Learning Engineer
Description: On any given day at Disney Entertainment ESPN Technology, we're reimagining ways to create magical viewing experiences for the world's most beloved stories while also transforming Disney's media business (more...)
Company: Disney Entertainment & ESPN Technology
Location: San Francisco
Posted on: 11/15/2024

Ground RF Hardware Engineer - Associate (Spring 2025) San Francisco
Description: Ground RF Hardware Engineer - Associate Spring 2025 Astranis is on a mission to bridge the digital divide by connecting the four billion people worldwide who currently lack internet access. We're doing (more...)
Company: Tbwa Chiat/Day Inc
Location: San Francisco
Posted on: 11/15/2024

Engineering Manager
Description: About Decagon:Decagon is building the most advanced conversational AI agents for the enterprise. Since starting the company, we've been on a tear, winning over customers like Duolingo, Notion, Rippling, (more...)
Company: Decagon AI, Inc.
Location: San Francisco
Posted on: 11/15/2024

Support Engineer - 2nd Shift - 4pm to 12am pst
Description: Bad software is everywhere, and we're tired of it. Sentry is on a mission to help developers write better software faster, so we can get back to enjoying technology.With more than 217 million in funding (more...)
Company: Sentry
Location: San Francisco
Posted on: 11/15/2024

Senior QA Engineer
Description: POSTED August 5, 2024 San Francisco , California , United States 135,000 - 170,000 On-Site Full Time About the positionAt Kandji, we use an embedded team model for our QA Engineers. You
Company: Sage Valley Senior Living
Location: San Francisco
Posted on: 11/15/2024

Loading more jobs...

Big Data Engineer (The Data Pipeline Innovator)

Didn't find what you're looking for? Search again!

Other Engineering Jobs

Log In or Create An Account