HPC / MLOps Infrastructure Engineer – Insight Softmax Consulting team

Together with our partner, Insight Softmax Consulting, we are looking for experiencedi nfrastructure engineer to work for the prominent client from the aerospace industry. Please note that this can be seen as a single position, or as two separate positions HPC Infrastructure Engineer and MLOps Engineer, depending on the candidate’s skillset and experience.

While the exact nature and purpose of this work is confidential, it is an extremely exciting set of projects and objectives that touch HPC, ML, Simulation, and Quantum swim lanes. Insight Softmax Consulting’s platform team is building Computational Fluid Dynamics (CFD) services and architecture that will integrate directly with ML and Simulation services. You will interface and work with some of the most experienced and talented people on the planet across these swim lanes.

Key responsibilities:

  • The ideal candidate will have a strong background in building and maintaining high-performance computing services, expertise in distributed systems, and a passion for systems engineering, scaling, and data architecture. This role involves working closely with our development teams to design, implement, and optimize HPC and/or ML solutions that meet our growing needs.
  • Design, build, and maintain high-performance computing (HPC) infrastructure.
  • Develop and implement scalable distributed systems for complex computing tasks.
  • Scaling services and systems from smaller deployments of ~50 nodes into larger ~250+ node clusters.
  • Using orchestration, configuration management, virtualization, linux, CLI, deploy tools, monitoring, APM.
  • Monitor HPC systems performance and implement improvements to ensure scalability and efficiency.
  • Optimizing engineering and deliverables to balance feature development, product quality, service reliability.
  • Design, build, and maintain scalable and reliable machine learning and data science services.
  • Implement and manage data architectures, ensuring efficient data storage, processing, and retrieval mechanisms.
  • Develop and optimize batch scheduling systems to manage workflows and data pipelines efficiently.
  • Collaborate with data scientists, machine learning engineers, and software developers to integrate machine learning models into production environments seamlessly.
  • Stay up-to-date with the latest technologies and trends in machine learning infrastructure, data management, and distributed systems.

Required Skills:

  • At least 3-5 years of experience in HPC infrastructure, systems engineering, or a similar role.
  • Proven experience in machine learning infrastructure, data management, and architecture.
  • Strong understanding of batch scheduling, distributed systems, and systems engineering principles.
  • Experience with scaling and operating machine learning models and data processing pipelines in production environments (eg: DVC, MLflow).
  • HPC CFD experience.
  • Experience with containerization (Docker) and container orchestration tools (e.g., Kubernetes).
  • Strong understanding of systems engineering principles and scaling strategies.
  • Deep knowledge of at least one of AWS, GCP, or Azure. Preferably AWS.
  • Strong linux chops.
  • Experience with data architecture and large-scale data processing.
  • Building on-prem systems (on-premise, data center, rack-n-stack, etc).

Additional Information

  • Significant overlap with US CT time (-6 to UTC) is expected.
  • On-call rotations have not yet started in our business, but will likely begin sometime in 2024.
  • Travel for up to 4 weeks per year to customer destinations.

What we offer

  • Mix of serious projects and great working atmosphere, well recognized on market.
  • Dynamic international work environment.
  • Skilled and senior co-workers.
  • Very good financial compensation.
  • Private medical care.
  • Personal and professional development – internal Tech talks and soft skills trainings.

About Insight Softmax Consulting:

When an organization needs Data Science, Machine Learning, and AI experts, Insight Softmax is the team to call. We don’t just use Data Science tools, we build them.

Our engineers are at the forefront of coding innovation and development, designing Data Science tools from the ground up. Our skilled engineers can tackle any challenge from creating GPU compilers to improving libraries for gradient boosted trees. We work with businesses in industries like technology, retail, finance, energy, automotive, gaming and more.

Join our team of world-class data scientists and make a real-world, tangible impact on our clients’ most challenging problems.

WHO WE ARE?

Bakson Ltd is a software development company based in Belgrade. We are working with teams around the world and take pride on variety of projects we handle and technology we use.

HOW DO WE WORK?

Our workflow is inspired by Agile and Lean principles. We’re not devoted to Scrum or any other framework, but are trying to work in small batches, with fast feedback and very close interaction with product owners.

The emphasis in our team is on collaboration and mutual support – sharing project workflow with globally distributed teams, contributing code to core global services and applications, and encouraging cultural exchange between development groups. Bakson encourages working from home, and the distributed nature of our teams requires us to have flexibility around working hours. We’re familiar with asynchronous and remote work. A Software Engineer in our company is a core writer of code, but also an inspirer and an exemplar to other developers…

Basically, what we care about is that you are a self-starter, happy to work with others, and prepared to adapt and do your best.

HOW TO APPLY?

We aim for our hiring process to be as collaborative and realistic as possible, so it’ll be focused on writing and reviewing code – both written by you and by others. We want you to feel like you’d be comfortable working with us, and we also want to feel the same way, so you’ll meet quite a few of the team, and interact with them in as close to a life-like way as possible. This is a two-way street – we’re keen for you to like us as much as the other way around. If you’d like get started, you can apply by pressing the “apply” button on this webpage or by sending a CV or an introductory email to [email protected]

This website uses cookies for a better browsing experience, as explained at cookie policy. We are using cookies necessary for website functionality and analytics cookies:

OK