Performance and Tools Monitoring Management Job in New York 10022, New York US
DESCRIPTION:
The incumbent will be part of newly formed performance management team, providing performance and capacity related assistance to various application and Infrastructure teams globally. The individual will be responsible for building the initial practice by doing an analysis of what the current capabilities are and based on that building and implementing performance and capacity management capabilities to suit the organization’s needs. The individual will also be expected to evaluate, recommend, and prototype new solutions as they pertain to day to day performance and capacity across Networks, systems, storage, databases and applications. A key job responsibility will be to focus on operational excellence (reduction in number performance capacity related incidents, improve service levels on requests etc.). Ideal candidate will have a good understanding of technology as well as a strong grasp on process improvement (leveraging ITIL, MOF, etc.).
RESPONSIBILITIES:
· Own the following performance / capacity / monitoring tools: Nagios, Corvil, OPNET/ITRS, Configuration mgmt., Capacity mgmt., Log monitoring, Whats up Gold, NetFlow
· Develop and grow the usage of these tools across all systems and network devices.
· Provide ongoing reporting to sr. mgmt, app owners, product managers
· Pinpoint problems before they happen.
· Triage performance problems across all systems and network devices prior to escalating to Core Infrastructure teams.
· Identify products / tools / solutions to continue to improve performance management capabilities.
· Identify synergies between application and systems monitoring tools.
· Identify and expand the coverage of the performance management practice to include Storage, Databases etc.
· Constantly fine tune the performance / monitoring tools to include components that are critical but not being monitored.
· Participate, lead and define a capacity management program for key systems in conjunction with App dev teams.
· Create and maintain a dashboard to highlight key performance metrics.
· Work closely with Core Infrastructure teams to remediate performance / capacity issues.
· Participate in key ITIL processes like problem management / incident management etc.
SPECIFIC RESPONSIBILITIES: