Senior Site Reliability Engineer - BeOne
Join us as a Senior Site Reliability Engineer, where your expertise will directly shape innovative infrastructure solutions and reliability best practices across our entire organization.
We usually respond within a week
Senior Site Reliability Engineer (GCP, Kubernetes)
About the role:
Take charge of ensuring our data-intensive infrastructure is robust, secure, scalable, and optimized for exceptional performance, delivering best experiences for our customers.
As a Senior Site Reliability Engineer, you'll have a direct and influential role in shaping our organization's reliability strategy and infrastructure. You'll proactively create robust solutions, implement best practices, and drive infrastructure excellence across all teams.
Join us remotely, you can be located anywhere around the CET time zone, as our work is 100% online. The position is full-time.
About us:
BeOne is a next-generation neobank that redefines how individuals and businesses manage money by blending traditional and digital finance. Our platform offers multi-currency accounts, ultra-low fees, real-time global payments, and robust financial tools, all within an intuitive, refined interface.
Our bold vision is to become the largest regulated funds and data transfer network for both retail and business customers. We empower users with financial freedom, security, and efficiency, whether for personal finances, business operations, or global investments.
In this role, you will:
- Define and lead the vision and strategy for Site Reliability Engineering (SRE), ensuring alignment with both business goals and engineering priorities.
- Architect, develop, and maintain infrastructure on Google Cloud Platform (GCP) and Google Kubernetes Engine (GKE), focusing on high-performance design that prioritizes security, availability, and reliability.
- Design and implement automated solutions for system reliability, capacity planning, and incident response to reduce manual tasks and enhance operational efficiency.
- Collaborate closely with engineering and product teams to build highly available, scalable, and fault-tolerant systems.
- Establish and manage Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to continuously improve system reliability.
- Document implemented solutions clearly and comprehensively.
- Mentor and guide engineering teams, promoting SRE principles, DevOps culture, and best practices throughout the organization.
- Stay informed on industry trends, proactively integrating new tools, technologies, and methodologies to consistently enhance reliability and operational excellence.
- Maintain a strategic balance between robust security practices and a flexible, collaborative working environment.
- Actively participate in daily stand-ups and planning meetings.
What we expect from you:
- 5+ years in a DevOps, SRE, or similar role in FinTech business domain
- Strong experience in managing platforms autonomously, with a focus on risk assessment and decision-making
- Proficiency in at least one programming language: Python, GoLang, C++, or Java.
- Strong Linux administration skills (Debian/Ubuntu)
- Solid grasp of LAN/WAN networking, firewalls, proxy servers, load balancers, and protocols (HTTP(s), DNS, SSH, TCP/IP, REST)
- Hands-on experience with Docker containerization
- Familiarity with CI/CD systems and version control
- Expertise in Kubernetes and Helm
- Experience with public cloud platforms (GCP or AWS, or Azure).
- Proven ability to implement redundancy and disaster recovery scenarios
- Track record in scaling high-efficiency production systems
- Proficiency with observability tools (e.g., Prometheus, Grafana, Grafana Mimir, OpenTelemetry)
- Strong written and spoken English (B2 level or higher)
Nice to Have:
- Experience with Argo CD and Argo Rollouts.
- Familiarity with technologies such as Kafka, Redis, Nginx, Apache HTTP Server, OpenVPN, and Nats.
- Knowledge of logging tools (Kibana, FluentD, Elasticsearch).
- Expertise in configuring, managing, and optimizing large PostgreSQL databases.
- Understanding of SSO and Okta technologies.
- Self-motivated, accountable, and capable of working independently.
- An interest in finance, trading, and crypto.
Why it’s worth a try - advantages of working at ICEO:
- Remote-first company - we enable you to work from anywhere in the world.
- Flexible working hours - we understand the challenges of juggling the personal and professional lives. That is why we have core working hours between 11 am and 3 pm CET, offering you the opportunity to choose when you work outside of those hours
- 38 days PTO - you have 38 days of paid time off per year, such that you can recharge and relax.
- Learning & development. Opportunity to grow by accessing internal and external learning & development programs.
- A modern technical stack with an emphasis on quality
Recruitment Process:
- Screening with Talent Acquisition Partner
- First interview with the Hiring Manager
- Technical Challenge Interview with DevOps Team
Want to know more?:
- Department
- Technology
- Remote status
- Fully Remote
- Employment type
- Contract
Senior Site Reliability Engineer - BeOne
Join us as a Senior Site Reliability Engineer, where your expertise will directly shape innovative infrastructure solutions and reliability best practices across our entire organization.
Loading application form
Already working at ICEO ?
Let’s recruit together and find your next colleague.