TRAFFIC Western Cape

Data Engineer

Talent Safari

Job Description

About the Role

Data engineering at Paystack focuses on building and extending platforms for managing data at scale. This involves data ingestion, processing, storage, and egress. Data engineers are also responsible for creating and maintaining the infrastructure on which our data platforms run.

Data engineers operate across a diverse tech stack. They are expected to be adaptable and drawn to learning new skills and technologies.

The role requires a proactive individual who can work independently and collaboratively within a remote first environment, has a strong software engineering background with good experience building and maintaining data pipelines, expertise in Python, and experience in streaming technologies.

What You Will Do
  • Data Pipeline Development: Design, develop, and maintain robust data pipelines using ETL and ELT methodologies to process and integrate data from various sources into a data lake, a central data warehouse, operational data stores, analytical data marts, and various application interfaces.

  • Streaming Data Processing: Implement and manage real time data streaming solutions utilising Kafka, Debezium, and Kafka Connect.

  • Workflow Orchestration: Build, schedule, and maintain custom workflows using Apache Airflow to ensure timely and accurate data processing and delivery.

  • Database Management: Work with a variety of database technologies, including relational databases (MySQL, PostgreSQL), NoSQL databases (MongoDB), and analytical/big data systems (Redshift, BigQuery, SingleStore).

  • Infrastructure as Code: Employ tools like Terraform, Kubernetes, and Helm to manage and provision infrastructure efficiently.

  • CI/CD Implementation: Develop and maintain continuous integration and deployment pipelines to streamline development processes.

  • Testing and Quality Assurance: Conduct unit and integration testing to ensure high code quality, data integrity, and system reliability.

  • Collaboration: Engage with cross functional teams, including data scientists, analysts, and business stakeholders, to understand data needs and deliver solutions.

  • Documentation: Maintain clear and comprehensive documentation of data processes, workflows, and systems.

  • Monitoring and Support: Monitoring system performance and addressing faults and failures in production systems as part of an on call rotation.

What You Have
  • Educational Background: Bachelor's degree in Computer Science, Engineering, or a related field.

  • Programming Skills: Proficiency in Python is essential. Knowledge of JavaScript and Scala is advantageous.

  • Data Engineering Experience: Minimum of 3 years of experience in data engineering roles, with a focus on building and managing data pipelines.

  • Software Engineering Experience: Minimum of 2 years of experience in software and/or application development roles (can be concurrent with data engineering experience).

  • Streaming Technologies: Hands on experience with Kafka, Debezium, and Kafka Connect.

  • Data pipeline orchestration tools: Proficiency in a data pipeline orchestration tool or suitable workflow orchestration tool like Apache Airflow (preferred), Databricks, Dagster, or Airbyte.

  • Database Expertise: Strong understanding and hands on experience working with various database technologies, including MySQL, PostgreSQL, MongoDB, and Redshift (BigQuery and SingleStore advantageous).

  • Infrastructure Tools: Experience with Terraform, Kubernetes, and Helm for infrastructure management.

  • Cloud Computing: Solid knowledge of cloud computing concepts, with experience in AWS services being advantageous.

  • SQL Proficiency: Ability to write complex SQL queries across different dialects.

  • Testing Practices: Familiarity with unit and integration testing methodologies.

  • CI/CD Pipelines: Experience in setting up and maintaining CI/CD pipelines.

  • Data Science Tools: Exposure to analytical systems and basic data science tooling. Familiarity with basic machine learning and analytical modelling concepts advantageous.

  • BI Reporting Platforms: Exposure to self service reporting tools like Tableau, Looker, and DOMO.

Who You Are
  • A strong communicator: Good verbal and written communication skills, with the ability to convey complex technical concepts to non technical stakeholders.

  • A Collaborator: Demonstrated ability to work collaboratively within a team and across departments.

  • Adaptable: Comfortable working in a fast paced environment with changing priorities, technologies, and tooling. Life long learners will do well here.

  • A Problem Solver: Strong analytical and problem solving skills.

About This Role

Career insights for Database Architects positions

Salary Benchmark
R24,300/month
R15,695 to R38,292/month
Source: WageIndicator ZAR data
Job Outlook
This career will grow rapidly in the next few years.
Key Skills for This Role
Complex Problem Solving Critical Thinking Judgment and Decision Making Reading Comprehension Systems Analysis
Common Technologies
Amazon DynamoDB Elasticsearch (now Elastic) MongoDB Atlas Apache Hive Blackboard Learn IBM Db2 Django Angular