This site uses cookies. To find out more, see our Cookies Policy

Senior Manager, Enterprise Infrastructure & Site Reliability in Plantation, FL at TradeStation

Date Posted: 5/16/2019

Job Snapshot

Job Description


Senior Manager, Enterprise Infrastructure & Site Reliability
Plantation, FL
 
TradeStation’s Operations team relies on its Engineering teams to support our enterprise network and systems. The Senior Manager plays a key role in our operational efforts. The person will be in charge of driving the team(s) that has full stack ownership of our enterprise network and systems. This position requires technical knowledge, ability to communicate properly with the team and at the same time, focus on growing the System and Network team(s) both by recruiting and mentorship. This person will be in charge of transforming the team from a traditional IT team to more of a software development team with production support responsibilities.

 
ESSENTIAL JOB FUNCTIONS:
 
You will be responsible for managing TradeStation’s enterprise System and Network Engineering teams. You will own the execution of technical roadmap for the team(s) in our environment. As the company continues to grow, you will mentor and develop the team as we scale. You will emphasize efficiency, automation, scalability and high-availability, and the CORE principles of DevOps.
 
You must be able to lead the team not only to achieve the uptime established by SLAs but also ensure this is achieved with the maximum level of automation and efficiency.
 
Responsibilities:
  • Full stack ownership of up to 10-12 System and Network Engineers
  • Create automation mindset environment.
  • Transform team to automation engineers 
  • Full stack ownership of production support and be available for rotating 24x7 on call support
  • Transform the team to be DevOps oriented; work to integrate the data center team to the different product development teams
  • Work with vendor representatives and other technology operations teams to coordinate, escalate, troubleshoot, and resolve service interruptions as expeditiously as possible
  • Manage a large enterprise operation, ensuring service levels are met and adverse impacts are kept to a minimum
  • Implement automation solutions and help lead implementation of new innovations
  • Participate in disaster recovery and business continuity programs
  • Set performance targets and thresholds and define key success criteria
  • Report and analyze systems dashboards with recommendations to Senior/Executive Management
  • Develop a quality improvement strategy to achieve objectives
    • Drive toward 100% automated test deployments and self-service tools
    • Facilitate product specific operations strategy and deployment management plans
    • Assure the viability, functionality and effectiveness of essential systems and monitoring tools




People Management:

  • Help the Engineers develop their careers, assigning them to projects tailored to their skill levels, long-term skill development, personalities, and work styles
  • Inspire and mobilize teams to perform and deliver their best
  • Build and maintain ongoing relationships with peers and other departments
  • Manage and ensure timely response to support issues
  • Provide effective communication regarding issues, objectives, initiatives and performance to teams, peers and management
  • Forecast resource requirements to manage the business



Program Management:

  • Develop integrated quality plan for the Engineers and ensure successful execution
  • Manage project interdependencies and risks to ensure project efforts remain in synchronization
  • Ensure timely retrospection, performance analysis and benefit realization for each project
  • Report SLA/KPI dashboards with analysis and recommendations to Senior/Executive Management
  • Identify, manage and communicate risks defined by systems to management and take corrective action, escalating as needed, to resolve and achieve commitments

KNOWLEDGE, SKILLS & ABILITIES:
  • Demonstrated leadership and communications skills
  • Experience working in development team(s) that have delivered commercial software or software-based services
  • Knowledge of change management controls/processes
  • Ability to effectively present information and respond to questions from other departments and from all levels of the organization
  • Must have technical knowledge to understand the environment and provide management updates when needed
  • Experience with managing hundreds to thousands of Windows machines
  • Understanding of complex networks
  • Knowledge of cloud technologies with ability to implement solution and support them

EDUCATION & EXPERIENCE
  • BS degree in Computer Science, Mathematics, or Business.
  • AWS or AZURE cloud experience
  • Must have a development background
  • 8 years of experience in an organization with a large, high availability, 24x7, mission critical operations environment supporting enterprise systems
  • 4 years of experience in a leadership/management role with several direct reports