PayPal is seeking a skilled Service Mesh Engineer to help contribute to the design, implementation, and maintaining of the service mesh infrastructure across their distributed systems to ensure reliability, security, and observability.
Requirements
- Hands-on experience with Istio, Linkerd, Consul Connect, or AWS App Mesh; Envoy Proxy management; strong grasp of service discovery, load balancing, circuit breaking, retries, and fault injection.
- Proficient in Go or Python for automation/CLI tool development; knowledgeable in concurrent programming, networking libraries, API integrations, and microservices.
- Practical knowledge of Kubernetes (CRDs, operators, networking), Docker, Helm, and GitOps.
- Deep understanding of TCP/IP, HTTP/2, gRPC, TLS/mTLS, network policies, microsegmentation, and zero-trust security.
- Experience with distributed tracing, metrics (Prometheus/Grafana), log aggregation, SLI/SLO monitoring.
- Advanced bash scripting, awk/sed usage, packet analysis (tcpdump/Wireshark), and system troubleshooting.
- Infrastructure as Code (Terraform, Pulumi); CI/CD pipeline integration.
Responsibilities
- Delivers complete solutions spanning all phases of the Software Development Lifecycle (SDLC) (design, implementation, testing, delivery and operations), based on definitions from more senior roles.
- Advises immediate management on project-level issues
- Guides junior engineers
- Operates with little day-to-day supervision, making technical decisions based on knowledge of internal conventions and industry best practices
- Applies knowledge of technical best practices in making decisions
- Designing and implementing a service mesh to manage service to service communication, service discovery, security, and reliability across both on prem and cloud.
- Working collaboratively with teams across the Infrastructure and Cloud Platforms organization to ensure that the service mesh delivers on commitments to performance, availability, and maintainability.
Other
- Minimum of 5 years of relevant work experience and a Bachelor's degree or equivalent experience.
- Strong problem-solving and debugging abilities.
- Proficient at analyzing distributed system interactions and tracing request flows.
- Skilled in incident response, documenting root causes, and proactive prevention.
- Ability to profile system performance, debug network/service mesh issues, and communicate effectively for cross-team collaboration.