Skip to main content
<SoftwareEngineer>
Gilbert Kipkorir

Gilbert Kipkorir

>

Build → Ship → Scale → Relax.I design the backend, automate the ops, and make outages nervous.☕ Java Expert • ☸️ Kubernetes Native • ☁️ Cloud Architect

</SoftwareEngineer>

About Me

Working with data

Building Reliable Systems & Empowering Teams

I'm Gilbert Kipkorir (Kipkorir Cheruiyot) — a Senior Software Engineer, Site Reliability Engineer, and Platform Engineer with a strong background in backend development, cloud-native infrastructure, and DevOps. With over 7 years of hands-on experience, I've architected and maintained scalable, resilient systems for fintech, telecom, and startup environments.

My expertise spans Site Reliability Engineering, Platform Engineering, backend development with Java and Spring Boot, automation, CI/CD, Kubernetes orchestration, cloud platforms (AWS/GCP), and high-performance API design. I have a strong track record in compliance-aware fintech system design — including PCI-DSS, API security, and data governance — ensuring production systems meet both engineering and regulatory standards.

As co-founder of Francium Sources, I apply an innovator's mindset to complex platform challenges — shipping practical, production-ready solutions that balance reliability, developer velocity, and business compliance.

JavaRustNode.jsTypeScriptKubernetesDockerAWSGCPTerraformPrometheusGrafanaCI/CDSREPlatform EngineeringDevOpsBackendFintech ComplianceInnovator
50+
Projects Completed
15+
Teams Collaborated
7+
Years Experience
20+
ML Models Deployed

Technical Skills

A concise, practical toolkit fortified by 7+ years delivering backend systems, SRE practices and cloud-native infrastructure — focused on reliability, scalability and operational excellence.

Languages & Frameworks

Core programming languages and frameworks for building scalable systems

Java7 years

Projects: Fintech APIs, Payment Systems, Microservices

Spring Boot6 years

Projects: REST/GraphQL APIs, Enterprise Services

Node.js & TypeScript5 years

Projects: Cloud-Native Apps, Real-time Services

Python6 years

Projects: Automation, Data Processing, ML Pipelines

Rust2 years

Projects: CLI Tools, High-performance Services

Go3 years

Projects: Cloud Services, CLI Tools

Cloud & DevOps

Cloud platforms and DevOps practices for reliable infrastructure

AWS (EC2, ECS, Lambda, RDS, S3)6 years

Projects: Multi-region deployments, Serverless

Google Cloud Platform4 years

Projects: GKE, Cloud Run, BigQuery

Kubernetes (K8s)5 years

Projects: Orchestration, Auto-scaling, Multi-tenant

Docker & Containerization6 years

Projects: Microservices, CI/CD Pipelines

Terraform5 years

Projects: Infrastructure as Code, Multi-cloud

Infrastructure & Automation

Infrastructure provisioning, configuration management, and automation

Infrastructure as Code (Terraform, CloudFormation)5 years

Projects: Cloud migrations, Disaster recovery

Configuration Management (Ansible)4 years

Projects: Server automation, Deployments

Scripting (Bash, Python)7 years

Projects: Automation, Data processing

Linux System Administration7 years

Projects: Production servers, Performance tuning

Networking & Security6 years

Projects: VPCs, Load balancers, Firewalls

Testing, CI/CD & Observability

Continuous integration, delivery, and comprehensive monitoring

CI/CD (GitHub Actions, Jenkins, GitLab CI)6 years

Projects: Automated pipelines, Blue-green deploys

Prometheus & Grafana5 years

Projects: Metrics, Dashboards, Alerting

ELK Stack (Elasticsearch, Logstash, Kibana)4 years

Projects: Log aggregation, Analysis

Distributed Tracing (Jaeger, Zipkin)3 years

Projects: Performance debugging

Unit & Integration Testing7 years

Projects: JUnit, Jest, PyTest

Interested in a deeper case study? Check the Projects section for detailed project writeups and the Insights/Blog for SRE & architecture lessons.

View Projects

Insights & Technical Writing

Deep dives into SRE practices, distributed systems, backend engineering, and lessons learned from building and operating production systems at scale.

SRE

Building Resilient Microservices: Lessons from Production

Key patterns and anti-patterns learned while scaling microservices in a high-traffic fintech environment—from circuit breakers to graceful degradation.

MicroservicesResilienceKubernetes
Jan 15, 20268 min read
Read More
DevOps

Observability at Scale: Prometheus & Grafana Deep Dive

How we implemented comprehensive observability for 100+ microservices, reducing MTTR by 70% and enabling proactive incident prevention.

PrometheusGrafanaMonitoring
Jan 8, 202610 min read
Read More
Kubernetes

Zero-Downtime Deployments with Kubernetes

A practical guide to implementing blue-green and canary deployments in production, with real-world examples and gotchas to avoid.

KubernetesCI/CDDevOps
Dec 20, 202512 min read
Read More
SRE

SLOs That Actually Work: From Theory to Practice

Moving beyond vanity metrics—how to define, measure, and act on Service Level Objectives that align with business outcomes.

SLOSLIReliability
Dec 10, 20257 min read
Read More
Backend

Java Performance Tuning for High-Throughput APIs

Profiling, optimization techniques, and JVM tuning strategies that helped us achieve <50ms p99 latency at 10K+ requests/second.

JavaPerformanceSpring Boot
Nov 28, 202515 min read
Read More
SRE

Incident Response Playbook: A Post-Mortem Culture

Building a blameless culture around incidents—structured playbooks, effective post-mortems, and turning failures into learning opportunities.

Incident ResponseCulturePost-Mortems
Nov 15, 20259 min read
Read More

Want to stay updated with new articles on SRE, backend engineering, and DevOps best practices?

Subscribe for Updates

Featured Projects

Selected backend and SRE projects with measurable outcomes. Client-sensitive implementations are labeled clearly and available as private case studies on request.

Cloud-Native API Platform

Cloud-Native API Platform

Private/NDA

Designed and deployed a scalable REST/GraphQL API platform on Kubernetes, supporting millions of requests per day for fintech and telecom clients.

Architecture and implementation details available on request due to client confidentiality.

Key Metrics

99.99% Uptime1M+ req/day<100ms Latency

Technologies

Node.jsTypeScriptKubernetesDockerAWSPrometheus
Private codeRequest case study
CI/CD Automation for Fintech

CI/CD Automation for Fintech

Private/NDA

Automated build, test, and deployment pipelines for a fintech product, reducing release time by 80% and enabling rapid iteration.

Pipeline design, rollout strategy, and before/after delivery metrics documented as a private case study.

Key Metrics

80% Faster Releases100+ Deployments/month

Technologies

JenkinsGitHub ActionsDockerTerraformAWS
Private codeRequest case study
Real-Time Monitoring & Alerting

Real-Time Monitoring & Alerting

Private/NDA

Implemented Prometheus/Grafana-based monitoring and alerting for telecom infrastructure, ensuring rapid incident detection and resolution.

Runbooks, alerting strategy, and incident outcome summaries available for interview walkthrough.

Key Metrics

99.9% Incident Detection10K+ Metrics Tracked

Technologies

PrometheusGrafanaKubernetesAWS
Private codeRequest case study
Infrastructure as Code Migration

Infrastructure as Code Migration

Private/NDA

Migrated legacy infrastructure to Terraform-managed cloud resources, improving reliability, scalability, and disaster recovery.

Terraform module patterns and migration checklist can be shared as anonymized snippets.

Key Metrics

100% Infra as Code50+ Resources Automated

Technologies

TerraformAWSGCPDocker
Private codeRequest case study
High-Availability Payment Backend

High-Availability Payment Backend

Private/NDA

Architected a payment backend with automated failover and disaster recovery, ensuring zero downtime and high transaction throughput.

Operational architecture and resilience trade-offs are available as a private technical deep-dive.

Key Metrics

0 Downtime10K+ Transactions/hour

Technologies

JavaSpring BootPostgreSQLKubernetesAWS
Private codeRequest case study

Work Experience

Cashia logo

Senior SRE & Backend Engineer

Cashia
Remote 2025 - Present

Architected and maintained backend services for Cashia, a fast-growing fintech startup. Focused on reliability, scalability, compliance, and developer experience.

  • Designed and deployed microservices for payment processing and fintech compliance workflows
  • Automated infrastructure provisioning and scaling via Platform Engineering practices
  • Enhanced API performance, security, and PCI-DSS-aligned data handling
Safaricom PLC logo

Senior SRE & Backend Engineer

Safaricom PLC
Nairobi, Kenya 2021 - 2025

Led reliability and platform engineering for mission-critical telecom services at Safaricom. Improved system uptime, automated incident response, and mentored SRE teams across compliance-sensitive environments.

  • Reduced downtime by 40% through proactive monitoring and automation
  • Implemented scalable logging, alerting, and compliance audit trail systems
  • Drove adoption of Kubernetes, cloud-native tools, and Platform Engineering standards
Freelance logo

Mobile Developer- Android

Freelance
Nairobi, Kenya 2019 - 2021

Developed and maintained Android applications for various clients, focusing on reliability, performance, and seamless integration with backend APIs and cloud services. Collaborated with cross-functional teams to deliver user-centric mobile solutions.

  • Published multiple apps to the Google Play Store
  • Built a reusable library for notifications, dialogs, and timely patches
  • Integrated mobile apps with scalable backend APIs and cloud services
  • Collaborated with cross-functional teams to deliver user-centric solutions
Francium Sources logo

Co-Founder

Francium Sources
Nairobi, Kenya 2020 - Present

Co-founded Francium Sources as an innovator and engineer, leading the development of scalable fintech solutions and cloud-native platforms. Oversaw product strategy, platform engineering, and SRE initiatives across compliance-conscious markets.

  • Launched a multi-tenant SaaS platform serving 50+ clients with compliance-ready infrastructure
  • Built a Kubernetes-based platform engineering foundation for high availability
  • Established SRE practices, automated CI/CD pipelines, and compliance monitoring

Education

BSc in Computer Science

Dedan Kimathi University
Nyeri, Kenya 2015 - 2018

Focused on software engineering, backend development, and distributed systems.

MSc in Artificial Intelligence

Open University of Kenya
Online 2025 - Present

Specialized in AI, machine learning, and cloud-native architectures.

Certifications

Let's Work Together

I'm available for Senior Backend, SRE, and DevOps roles or consulting. Reach out via phone, email or the form below and I'll get back to you quickly.

Get In Touch

Location

Westlands, KE

Follow Me