Job description
We are looking for a highly motivated Data Scientist with a strong foundation in machine learning (supervised and unsupervised), deep learning, LLMs, and feature engineering, who also brings hands-on data engineering expertise using Spark, PySpark, and Python. This hybrid role offers the opportunity to work across the full data pipeline — from data ingestion and transformation to model development and deployment — supporting high-impact business use cases.
Role and Accountabilities:
Build, evaluate, and deploy supervised and unsupervised machine learning models.
Perform feature extraction, selection, and engineering for large-scale datasets.
Work with deep learning architectures (CNNs, RNNs, Transformers) and fine-tune LLMs for specific business problems like classification, summarization, or retrieval tasks.
Conduct data analysis and visualization to derive actionable insights.
Collaborate with business and product teams to translate requirements into data science problems.
Design and implement ETL/ELT pipelines using Spark/PySpark for structured and unstructured data.
Handle large-scale data processing in distributed environments.
Build reusable, modular, and scalable data processing components.
Write efficient and clean Python code for data manipulation and transformation.
Ensure data quality, lineage, and governance throughout the pipeline.
Assist in deploying models into production environments (batch or real-time).
Participate in model monitoring and performance tracking
Skills
Experience:
Strong in Python, especially for ML and data engineering.
Proficient in Spark and PySpark (batch or streaming).
Experience with scikit–learn, XGBoost, LightGBM, or similar libraries.
Experiane with Git, Jupyter, Airflow (preferred), MLflow or similar tracking tools.
Experience with Microsoft Azure
5-7 years of professional experience in data science and data engineering roles.
Exposure to real-world deployment of ML models
Understanding of data warehousing concepts and data lakes.
Familiarity with ITIL processes and frameworks.
Excellent problem-solving and communication skills.