Our client is a leading company in the FinTech industry, driving innovation and providing cutting-edge financial solutions globally. As part of the firm continuous growth, they are seeking a talented and motivated AI Software Engineer in Large Language Models (LLM) and Generative AI to join the dynamic team.
As an AI Software Engineer, you will play a critical role in developing and optimizing advanced algorithms and models for natural language processing and generative AI within the software applications. You will collaborate with cross-functional teams of engineers and data scientists to design and implement state-of-the-art solutions that address complex challenges in the financial domain.
Responsibilities:
Develop, optimize, and maintain large language models and generative AI algorithms to solve complex problems in the financial industry.
Design and implement scalable and efficient deep learning architectures for natural language processing.
Collaborate with data scientists and subject-matter experts to understand business requirements and translate them into technical solutions.
Conduct regular model evaluations, diagnostic testing, and performance analysis to ensure the reliability and accuracy of the algorithms.
Stay up-to-date with the latest advancements in large language models, generative AI, and related technologies to drive innovation within the team.
Work closely with software engineers to integrate the developed models and algorithms into our existing software applications.
Debug and troubleshoot issues related to the implemented AI solutions and provide timely resolutions.
Effectively document code, methodologies, and processes to facilitate knowledge sharing and maintain well-organized project repositories.
Collaborate and share insights with peers, mentor junior team members, and actively contribute to the continuous improvement of our engineering practices.
Qualifications:
Bachelor's or Master / PhD in Computer Science, Mathematics, or a related field.
Solid foundation in machine learning, deep learning, and natural language processing techniques.
Proficiency in programming languages such as Python, TensorFlow, PyTorch, or similar frameworks.
Hands-on experience working with large language models, generative AI, and recurrent neural networks.
Strong understanding of statistical modeling, data analysis, and visualization techniques.
Excellent problem-solving skills with the ability to think creatively and propose innovative solutions.
Effective communication in Chinese AND English, and collaboration skills to work effectively in a team-oriented environment.
Experience in the FinTech industry or knowledge of financial concepts is a plus.
Candidates with experience in the academic research areas would also be considered.