Senior Site Reliability Engineer - BeOne

Senior Site Reliability Engineer (GCP, Kubernetes)

About the role:

Take charge of ensuring our data-intensive infrastructure is robust, secure, scalable, and optimized for exceptional performance, delivering best experiences for our customers.

As a Senior Site Reliability Engineer, you'll have a direct and influential role in shaping our organization's reliability strategy and infrastructure. You'll proactively create robust solutions, implement best practices, and drive infrastructure excellence across all teams.

Join us remotely, you can be located anywhere around the CET time zone, as our work is 100% online. The position is full-time.

About us:

BeOne is a next-generation neobank that redefines how individuals and businesses manage money by blending traditional and digital finance. Our platform offers multi-currency accounts, ultra-low fees, real-time global payments, and robust financial tools, all within an intuitive, refined interface.

Our bold vision is to become the largest regulated funds and data transfer network for both retail and business customers. We empower users with financial freedom, security, and efficiency, whether for personal finances, business operations, or global investments.

In this role, you will:

Define and lead the vision and strategy for Site Reliability Engineering (SRE), ensuring alignment with both business goals and engineering priorities.
Architect, develop, and maintain infrastructure on Google Cloud Platform (GCP) and Google Kubernetes Engine (GKE), focusing on high-performance design that prioritizes security, availability, and reliability.
Design and implement automated solutions for system reliability, capacity planning, and incident response to reduce manual tasks and enhance operational efficiency.
Collaborate closely with engineering and product teams to build highly available, scalable, and fault-tolerant systems.
Establish and manage Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to continuously improve system reliability.
Document implemented solutions clearly and comprehensively.
Mentor and guide engineering teams, promoting SRE principles, DevOps culture, and best practices throughout the organization.
Stay informed on industry trends, proactively integrating new tools, technologies, and methodologies to consistently enhance reliability and operational excellence.
Maintain a strategic balance between robust security practices and a flexible, collaborative working environment.
Actively participate in daily stand-ups and planning meetings.

What we expect from you:

5+ years in a DevOps, SRE, or similar role in FinTech business domain
Strong experience in managing platforms autonomously, with a focus on risk assessment and decision-making
Proficiency in at least one programming language: Python, GoLang, C++, or Java.
Strong Linux administration skills (Debian/Ubuntu)
Solid grasp of LAN/WAN networking, firewalls, proxy servers, load balancers, and protocols (HTTP(s), DNS, SSH, TCP/IP, REST)
Hands-on experience with Docker containerization
Familiarity with CI/CD systems and version control
Expertise in Kubernetes and Helm
Experience with public cloud platforms (GCP or AWS, or Azure).
Proven ability to implement redundancy and disaster recovery scenarios
Track record in scaling high-efficiency production systems
Proficiency with observability tools (e.g., Prometheus, Grafana, Grafana Mimir, OpenTelemetry)
Strong written and spoken English (B2 level or higher)

Nice to Have:

Experience with Argo CD and Argo Rollouts.
Familiarity with technologies such as Kafka, Redis, Nginx, Apache HTTP Server, OpenVPN, and Nats.
Knowledge of logging tools (Kibana, FluentD, Elasticsearch).
Expertise in configuring, managing, and optimizing large PostgreSQL databases.
Understanding of SSO and Okta technologies.
Self-motivated, accountable, and capable of working independently.
An interest in finance, trading, and crypto.

Why it’s worth a try - advantages of working at ICEO:

Remote-first company - we enable you to work from anywhere in the world.
Flexible working hours - we understand the challenges of juggling the personal and professional lives. That is why we have core working hours between 11 am and 3 pm CET, offering you the opportunity to choose when you work outside of those hours
38 days PTO - you have 38 days of paid time off per year, such that you can recharge and relax.
Learning & development. Opportunity to grow by accessing internal and external learning & development programs.
A modern technical stack with an emphasis on quality

Recruitment Process:

Screening with Talent Acquisition Partner
First interview with the Hiring Manager
Technical Challenge Interview with DevOps Team

Want to know more?:

take a look at our profile on Clutch and find out what our clients say about us
visit our website and check who we have helped to succeed

Senior Site Reliability Engineer - BeOne

Join us as a Senior Site Reliability Engineer, where your expertise will directly shape innovative infrastructure solutions and reliability best practices across our entire organization.

Senior Site Reliability Engineer (GCP, Kubernetes)

In this role, you will:

What we expect from you:

Why it’s worth a try - advantages of working at ICEO:

Recruitment Process:

Want to know more?:

Senior Site Reliability Engineer - BeOne

Join us as a Senior Site Reliability Engineer, where your expertise will directly shape innovative infrastructure solutions and reliability best practices across our entire organization.

Already working at ICEO ?