Job Title:
Python Developer- PySpark
Location:
Toronto, ON (Hybrid-3 days onsite)
Job Description/ Responsibilities:
Role Overview
We are seeking a skilled and motivated SAS to PySpark Developer to lead the migration of legacy SAS-based analytics workflows to scalable PySpark solutions. This role is critical to modernizing our data infrastructure and enabling advanced analytics capabilities across cloud platforms.
Key Responsibilities
Analyze and reverse engineer existing SAS programs, macros, and data flows.
Design and implement equivalent PySpark modules optimized for performance and scalability.
Develop and maintain ETL pipelines using PySpark integrated with AWS, Snowflake/DataBricks
Validate migrated outputs against SAS baselines to ensure accuracy and consistency.
Collaborate with cross-functional teams including data scientists, business analysts, and cloud engineers.
Document technical specifications, migration strategies, and validation protocols.
Participate in sprint planning, estimation, and delivery of migration milestones.
Required Qualifications
Bachelor’s degree in Computer Science, Engineering, or related field.
3+ years of experience in SAS programming and data engineering.
2+ years of hands-on experience with PySpark and big data technologies.
Strong understanding of data structures, algorithms, and performance tuning.
Experience with cloud platforms (AWS, Azure, Snowflake) and distributed computing.