TRAFFIC Western Cape

Data Scientist

RayAI Inc.

Job Description

We are seeking a highly skilled Data Scientist with deep expertise in modern AI/ML systems, including LLMs, multimodal models, fine-tuning techniques, and advanced retrieval architectures. In this role, you will design, prototype, and deploy AI-powered solutions that leverage state-of-the art language, vision, and agentic frameworks. You will work closely with engineering, product, and research professionals across the US and Europe to bring cutting edge AI capabilities into production environments.

Responsibilities
  • Design, build, and optimize LLM-powered systems using OpenAI, Anthropic, and open-source/local model families.

  • Architect and implement RAG pipelines, including hybrid search, query rewriting, prompt optimization, and reranking strategies.

  • Develop and maintain vector database infrastructures (Pinecone, Weaviate, Qdrant) for large-scale embedding storage and fast retrieval.

  • Train, evaluate, and retrain embedding models for domain-specific semantic search and knowledge retrieval.

  • Build and integrate multimodal AI solutions using OCR, CLIP, and modern vision architectures for text-image understanding.

  • Apply fine-tuning techniques (LoRA/QLoRA) to adapt foundation models to organizational datasets and specialized tasks.

  • Develop production-ready AI applications using Python, PyTorch, and modern orchestration frameworks.

  • Implement LLM orchestration with LangChain or LlamaIndex, including evaluators, tool abstractions, memory, and RAG components.

  • Establish robust evaluation frameworks to measure model performance, reduce hallucination, and ensure reliability in production.

  • Build agentic workflows using AutoGen, CrewAI, or similar frameworks to power automation and multi agent collaboration systems.

  • Stay current with research trends and apply theoretical and practical insights in Generative AI to drive innovation across the organization.

Qualifications
  • Experience in applied machine learning or data science, with at least 2 years focused specifically on LLMs or Generative AI.

  • Demonstrated experience building end to end RAG, fine tuning, or multimodal AI systems.

  • Strong proficiency in Python, PyTorch, and AI tooling ecosystems.

  • Experience deploying models at scale in production environments.

  • Strong understanding of evaluation metrics, model reliability, and safety/reduction of hallucination.

  • Familiarity with vector embeddings, vector databases, and semantic search.

  • Experience with agent frameworks such as AutoGen, CrewAI, or LangGraph-like toolkits.

  • Experience with distributed training, model optimization, quantization, or GPU acceleration.

  • Knowledge of DevOps/MLOps tooling for deploying LLM based systems.

  • Contributions to open source LLM or RAG projects.

Benefits
  • Competitive salary and performance based bonuses.

  • Fully remote, flexible work environment.

  • Modern laptop and hardware provided by us.

  • Specialized training in AI, automation, and digital productivity tools.

  • Global exposure-collaborate with top tier founders and fast growing startups.

  • Continuous learning and career growth opportunities in an international environment.

About This Role

Career insights for Data Scientists positions

Salary Benchmark
R24,300/month
R15,695 to R38,292/month
Source: WageIndicator ZAR data
Job Outlook
This career will grow rapidly in the next few years.
Common Technologies
Amazon EC2 Amazon Redshift AWS Cloud Microsoft PowerPoint C# (.NET) Perl R Apache Kafka

Job Overview

Date Posted
28 Apr 2026
Location
Western Cape, South Africa

Data Scientists Insights

Median Salary (ZAR)
R24,300/month
Job Outlook
This career will grow rapidly in the next few years.

Similar Opportunities

This page incorporates data from O_NET OnLine, courtesy of the U.S. Department of Labor, Employment and Training Administration (USDOL/ETA), under the CC BY 4.0 license. O_NET is a registered trademark of USDOL/ETA. Assessify has adapted and modified the original content. Please note that USDOL/ETA has neither reviewed nor endorsed these changes.