Are you interested in building the next generation of Internet services that touch hundreds millions of users across the globe every day?
This is an excellent opportunity to join one of Rakuten’s world large scale distributed fault tolerant systems of Site Reliability Engineering (SRE) work with One cloud. Rakuten’s development unit is the core of our entire group that drives our business today. You will be joining a diverse global team and play a central role in our technology innovations.
In the Cloud Platform Department (CPD), our mission is to build and maintain robust infrastructure and platform solutions that enable and empower Rakuten’s businesses around the world.
The Site Reliability Engineering (SRE) team is looking for people who are passionate to work in a global scale with experienced Software Development Engineers. To build out automation of critical service reliability and efficiency functions that ensures massively scaled, fault-tolerant and globally distributed service for our end users.
You can get experiences which you can design system architecture and build platform considering not only application layer but also hardware, network and middleware by collaborating various services having various service level and service platform.
Position: SRE Engineer
BS degree in Computer Science or related technical field involving coding and / or systems engineering
6-9 yrs of Experience.
Improvement of service’s performance and latency
Standardization and Automation for reducing Toils in service management
Providing standard monitoring systenms for stable operation of service
Providing standard design for expanding new service easily
Trouble Shooting of service in the trouble
Proficiency in one or more of the following: Go, Python, C, C++, Java, Rust,Perl, Ruby or shell scripting
Experience with algorithms, data structures and software design
Experience with UNIX operating systems internals and / or networking
Experience with CI/CD tools (ex. Jenkins, CircleCI)
Experience in automation and configuration management using Chef, Ansible, Terraform,
Experience with monitoring?troubleshooting
Experience with cloud platform (ex. OpenStack)
Experience with container solution and tools (ex. LXC/Docker/Kubernetes/Mesos)
Experience with and working knowledge of IP networking systems and protocols
Web application development experience
Scrum Master, Project Manager Experience
Software Engineer for Test Experience
Experience in service operations with Docker, Kubernetes
Experience Over 3 years of experience managing one of the following products: MySQL, Cassandra, Oracle, Couchbase, Hadoop or kafka”