The Infrastructure Management Imperative Smart Grid Infrastructure Management Challenges and Solutions
Which is most important Infrastructure, applications or the user experience? They all are. There is no end-user experience without a quality application running on a solid infrastructure.
An important distinction This topic is about managing the IT and Communications systems and intelligent devices that make Smart Grid business processes work It is not about controlling electrons or managing the customers that consume them.
Changes, Challenges and Problems
SmartGrid : What s Changing Scale, scope and complexity Ubiquitous Intelligent devices New protocols Blurring line between OT and IT Unprecedented volumes of Data
Integration Middleware What needs to be managed? OMS/DMS/SCADA RF (Mesh Network) Backhaul WAN (e.g. Cellular, WI-MAX, BPL etc.) MDMS Meter Data Warehouse Cell Relay Intelligent Grid Devices Head-end Collection Engine Load Control System Collections CIS / Billing Customer Portal
What are the challenges? Challenge Fault Management Description Remedy non responding devices Config. and Firmware Mgt Coordinate change/configuration management Performance Mgt Security Mgt Business Service Mgt. Get early warning of device/applications/systems degradation Protect against security threats Plus Achieve SLAs and KPIs Investment Protection Design and build future proof solutions.
What are the problems? Silos. No view of end to end business processes Tools and systems scalability Lack of ability to drill down Inadequate network event correlation Lack of simulation tools Limited visibility to business service impact Security
Bridging ICT and OT Business Processes ICT* Infrastructure Management Grid Device Infrastructure Management FCAPS/eTom & ITIL Models Real-time Situational Awareness ICT OT *ICT=Information & Communications Technology Business Impact
Success Strategies Using tools and best practices from ITIL and Business Service Management
Working smarter, not harder Applications Wasted time due to multiple people working on the same issue due to poor processes and organizational maturity Level 2 duplicates the level 1 response Other operations will identify and escalate if they cannot solve the problem Level 1 is already working on fix Infrastructure Application support Server support Network support Business Process Monitors Transaction Monitors User Experience Monitors App. Internals App. Monitor Native Server Tools Server Monitors Storage Monitors Network Monitors 3 rd party and open source User issue DNS server issue Connectivity issue
Another example Applications Events are raised in the application support tool Often the application is not the problem Who do we give it to now? SME s cannot see data from different sources necessary to properly isolate performance problems Events are raised in the SME s tool of choice No overall view of what s happening Event storms result in multiple teams working on the same problem No overall repeatable process control SME s like to solve the easy events, they get to be the guru Infrastructure Operations 1 Operations 2 Operations 3 Operations cannot add value to the events Pass events straight to subject matter experts Frequently not sure who to pass the event on to, resulting in allocation ping-pong
The Power of Business Service Management Application focused view Application and infrastructure subject matter experts Business availability Business transaction management Business service level management End-user experience Diagnostics and guided problem isolation Understand Business Service Status Consolidated infrastructure operations Service impact analysis Cross-domain correlation and root cause Consolidated event and performance Infrastructure lifecycle management Analyse and prioritize potential impact Infrastructure focused view Operations staff Event Service level management Discovery services One version of the truth UCMDB Business Service Model Integration with key ITSM processes Incident Problem Change Configuration Release
Incident Mgt Exposure Delivering holistic service assurance Resource and User view of service Service Management Service Model Topology SLA Mgt Service Health PM Service Quality Mgt FM KPIs & KQIs Alarms User Management User Experience Assurance Stakeholders: Outage management Billing, CRM Fraud EMS, DMS Marketing Resource Management NE Performance E/P - MS NE Faults & Events Signaling xdrs User Plane Meters concentrators Resource monitoring Probes IT - Cloud LAN substation WAN IHD monitored resource Grid/Electricity sensor/device Telecom device remote network
Business Service Mgt for Meter to Cash Monitor end-to-end processes, respond to bottlenecks, determine biz impact Developed with
Success Strategies Using tools and best practices from Real-time Grid Management
GEO-Visualization of an extensive AMI Incident
Drill down to Customer detail
GEOBULAR summary of AMI Incidents
Incident Investigation with Group Pinging
Correlation of AMI Incident with Grid Outage
Success Strategies Building a Next Generation Architecture for Smart Grid Service Assurance
Bridging ICT and OT Business Processes ICT* Infrastructure Management FCAPS/eTom & ITIL Models Fault Configuration Accounting Performance Security Business Service Mgt ICT Grid Device Infrastructure Management Real-time Situational Awareness Meter to Grid Connectivity Electrical state Capacity Geospatial Views NERC CIP compliance OT *ICT=Information & Communications Technology Business Impact
Conceptual Architecture Business Service Management Network Management FCAPS etom AMI/Grid Management EMS/ DMS IT Management ITIL S E N S E Unified Operational DB Service Assurance Applications Analytics Services Integration Services (SOA) Enabling Technologies Visualization Services Unified Object Model R E S P O N D Back Office Network Operations Center Smart Grid Network Smart Grid Devices Smart Customer Managed Objects
Thank You! Please drop by Alstom, HP and Infosys booths for further discussion and demonstrations jayant.s.kumar@alstom.com roy.pratt@hp.com william_lewis@infosys.com