Must Have Skills
Primary Skills
Strong SQL query writing skills with the ability to write and optimize complex queries.
Proficiency in at least one programming language: Python, Scala, or Java.
Basic understanding of Apache Spark and Big Data processing concepts.
Solid knowledge of SQL and NoSQL database concepts.
Experience with software engineering methodologies and lifecycle management tools such as JIRA.
Hands-on experience with code repositories and CI/CD pipelines.
Fundamental cloud knowledge in AWS, GCP, or Azure.
Secondary Skills
NoSQL data modeling
AWS services such as S3, Glue, EMR
Experience with DBT, Airflow, Flink, Kafka
Experience in developing and managing CI/CD pipelines
Nice to Have / Added Advantage
Knowledge or experience in LLM-based code conversion
Snowflake expertise
Familiarity with Agentic AI, RAG, and vector databases (client is exploring Agentic AI for traditional code conversion to a lakehouse architecture)