Developing AI-driven automation and next generation AI tools.
Strong understanding of data center network support and NOC operations, AI engineering, and a passion for developing innovative solutions to complex business challenges.
Contribute toward AI roadmap for AI-driven automation aligning with strategic company and client goals.
Work with engineering and data teams to design and architect scalable, robust, and innovative AI solutions (e.g., automated network diagnostics, bot recommendation systems, AI agents NOC operations).
Take on Solution Owner role in an Agile/Scrum environment, managing the product backlog, writing detailed user stories, defining acceptance criteria, and prioritizing features.
Drive evaluation, fine-tuning, and integration of open-source Large Language Models (LLMs) like Llama series \& other LLMs.
Develop and ownership of key components of the automation framework, including PowerShell and Python runbook executors.
Develop and train ML models for tasks such as anomaly detection, predictive maintenance, and capacity planning in IT environments.
Work with large datasets of IT operational data, performing data cleaning, feature engineering, and data analysis to improve model accuracy and performance.
Contribute to the development of internal automation tools and frameworks.
Deliver continuous service improvements by proactively identifying opportunities for process enhancements.
Develop AI/Gen AI point solutions to meet business needs.
Resolve and troubleshoot issues related to automation systems and AI models.
Collaborate with customers, IT operators, Network Engineers, and internal stakeholders to gather requirements, validate solutions, and ensure product-market fit.
Subject Matter Expert on AIOps, IT Process Automation (ITPA), and Runbook Automation (RBA), providing technical guidance and insights.
Documentation of automation processes, code, and models clearly and concisely.
Contribute to team results to achieve team goals and objectives.
Take on skills, knowledge development and training of team members.
Requirements
Around 4+ years of experience in IT services, with at least 3 years in:
IT Service Automation – Orchestration, Scripting, \& Process Assessment.
Work experience in AI Engineering; Develop and deploy AI Agents and cutting-edge GenAI \& ML solutions to address complex business challenges.
Strong knowledge to integrate various tools and building analytics/insights.
Verse with at least two scripting languages such as PowerShell, Python, or Shell Script.
Strong foundation across core IT domains, with a specific emphasis on Network \& Network Services, including understanding network topologies, protocols, and common operational issues.
Competence at implementing solutions using state-of-the-art LLMs, with hands-on experience with both open-source models (e.g., Llama series) and proprietary models (e.g., GPT-4).
Verse with major deep learning framework, preferably PyTorch, for model experimentation and fine-tuning.
Verse with high-level programming language (Java, Python) and experience with containerization technologies (Docker, Kubernetes) for deploying AI models.
Solid understanding and practical experience with cloud platforms such as Azure or GCP, including knowledge of their networking services (e.g., VNets, VPCs).
Work experience with AI/ML libraries and frameworks.
Sound knowledge of ITIL process on one or more service lifecycle or service capability modules.
Good Knowledge in one or more System administrative activities like monitoring, service requests, incident management, change management, \& maintenance.
Ability to converse and explain concepts and solutions clearly and concisely.
Analytical and problem-solving skills.
Mentorship experience
Kindly understand that only shortlisted candidates will be notified.
WeLead Solutions Pte Ltd
Poon Wai Soon, Bernard
Reg. No: R2197713
EA License No.: 23C1882