Back to Jobs
Oracle

Senior Network Reliability Developer at Oracle

Oracle Remote - Abilene, TX

Job Description

DescriptionWe are seeking a skilled and proactive engineer with 3-5 years of experience to join our Network Reliability (NRE) team. The NRE team is our front-line for addressing physical network issues and operates 24x7x365 to ensure the reliability and efficiency of our physical network infrastructure. The team is responsible for performing data collection triage technical analysis incident mitigation and redirection as necessary to maintain and optimize operations.Our primary objectives are:Ensure maximum possible service availability and performanceDeliver premier customer serviceStructured cabling knowledge with a highly critical eye for quality installationDatacenter construction knowledgeProficiency in using OTDR and other cable troubleshooting equipment along with the capability to instruct and train engineers on their proper usage.Provide comprehensive support to our Engineering and other Operational and technical teamsThese objectives translate into a broad and dynamic scope of responsibilities for the GNOC team. Engineers will have the capability to centrally manage OCIs networks and implement automated solutions to address common operational challenges efficiently.Note:This is a 24/7/365 remote position and includes day/night shift weekend and holiday work with occasional travel to Abilene TX based on business requirements. This position will also require consent to the processing of biometric data for identity verification and access control.Career Level - IC3ResponsibilitiesWork with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration technical dependencies and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack with focus on security resiliency scale and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale capacity security performance attributes and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.NRE OperationsUsing existing procedures and tooling develop and safely complete network changesMentor onboard and train junior engineersParticipate in operational rotations providing break-fix supportIdentifying actionable incidents using monitoring systems strong analytical problem-solving skills to mitigate network events/incidents and following up on routine root cause analysis (RCA) coordinating with support teams and vendorsProvide on-call support services as needed job duties are varied and complex needing independent judgmentJoin major event/incident calls use technical and analytical skills to resolve network issues that impact Oracle customers/servicesFault handling and escalation - Identifying and responding to faults on OCIs systems and networks collaborating closely with 3rd party suppliers handling escalation through to resolutionLeadership:Collaborates with the GNOC Shift Leads and management to ensure the efficient and timely completion of daily GNOC responsibilitiesTakes the initiative to lead contribute to and participate in the identification development and evaluation of projects and tools aimed at enhancing the overall effectiveness of OCI.Develops and drives runbook audits and updates to ensure compliance and collaborate with partner service teams to ensure operational processes are aligned between teamsConduct interviews and participate in the hiring process for junior level engineersLead and/or represent the GNOC in vendor meetings reviews or governance boardsAutomations/ScriptingThe role includes collaborating with networking automation services to integrate support tooling and frequently developing scripts to automate routine tasksPreference for individuals with experience in scripting and network automation - Python Puppet SQL and/or AnsibleYou will use automation to complete work and develop scripts for routine tasksProject ManagementLead technical projects such as the development and improvement of runbooks and methods of procedures driving high visibility technical projects and onboarding and training new team membersAssists in the implementation of short medium and long-term plans to achieve project objectives and regularly interacts with senior management or network leadership to ensure team objectives are metTECHNICAL QUALIFICATIONS:NetworkingAdvanced knowledge in the following protocols: PBGP/OSPF/IS-IS TCP IPv4 IPv6 DNS DHCP MPLSAdvanced knowledge in the following networking protocols: TCP/IP VPN DNS DHCP and SSLComprehensive/Broad experience in at least 3 of the following network technologies: Juniper Cisco Arista InfiniBand firewalls switches and circuit managementAdvanced analytical skills and ability to collate and interpret data from various sourcesAbility to diagnose network alerts to assess and prioritize faults and respond or escalate accordinglyExperience working in a large ISP or cloud provider environmentExposure to commodity Ethernet hardware (Broadcom/Mellanox)Cisco and Juniper certifications are desiredGPU/RDMAExperience in GPU/RDMA network environments is highly desiredExperience with High Performance ComputeExperience with InfiniBandDesignParticipate in Network lifecycle management through network build and/or upgrade projectsParticipate in network solution design reviewSOFT SKILLS & OTHER DESIRED EXPERIENCE:Highly motivated and self-starterBachelors degree is preferred with at least 3-5 years of network-related experienceStrong oral and written communication skillsExcellent time management and organization skills#LI-KR4QualificationsDisclaimer:Certain US customer or client-facing roles may be required to comply with applicable requirements such as immunization and occupational health mandates.Range and benefit information provided in this posting are specific to the stated locations onlyUS: Hiring Range in USD from: $74900 to $158200 per annum. May be eligible for bonus and equity.Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge skills experience market conditions and locations as well as reflect Oracles differing products industries and lines of business.Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.Oracle US offers a comprehensive benefits package which includes the following:1. Medical dental and vision insurance including expert medical opinion2. Short term disability and long term disability3. Life insurance and AD&D4. Supplemental life insurance (Employee/Spouse/Child)5. Health care and dependent care Flexible Spending Accounts6. Pre-tax commuter and parking benefits7. 401(k) Savings and Investment Plan with company match8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.9. 11 paid holidays10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.11. Paid parental leave12. Adoption assistance13. Employee Stock Purchase Plan14. Financial planning and group legal15. Voluntary benefits including auto homeowner and pet insuranceThe role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.Career Level - IC3Required Experience:Senior IC Key Skills Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting Employment Type : Full-Time Experience: years Vacancy: 1 Yearly Salary Salary: 74900 - 158200

Resume Suggestions

Highlight relevant experience and skills that match the job requirements to demonstrate your qualifications.

Quantify your achievements with specific metrics and results whenever possible to show impact.

Emphasize your proficiency in relevant technologies and tools mentioned in the job description.

Showcase your communication and collaboration skills through examples of successful projects and teamwork.

Explore More Opportunities