Troubleshooting Your SUSE TUT6113. Cloud. Paul Thompson SUSE Technical Consultant. Dirk Müller SUSE OpenStack Engineer

Similar documents
Trouble?! Shoot your Cloud!

SUSE OpenStack Cloud. Enabling your SoftwareDefined Data Center. SUSE Expert Days. Nyers Gábor Trainer &

Docker Networking In OpenStack What you need to know now. Fawad Khaliq

Cloud in a box. Fully automated installation of SUSE Openstack Cloud 5 on Dell VRTX. Lars Everbrand. Software Developer

Define Your Future with SUSE

Managing Linux Servers Comparing SUSE Manager and ZENworks Configuration Management

Provisioning with SUSE Enterprise Storage. Nyers Gábor Trainer &

Using Crowbar to Deploy Your OpenStack Cloud. Adam Spiers Vincent Untz John H Terpstra

Expert Days SUSE Enterprise Storage

SUSE Manager Roadmap OS Lifecycle Management from the Datacenter to the Cloud

Best practices with SUSE Linux Enterprise Server Starter System and extentions Ihno Krumreich

SUSE Manager in Large Scale 17220

Welcome to SUSE Expert Days 2017 Service Delivery with DevOps

How To Make Databases on SUSE Linux Enterprise Server Highly Available Mike Friesenegger

Build with SUSE Studio, Deploy with SUSE Linux Enterprise Point of Service and Manage with SUSE Manager Case Study

Novell SLES 10/Xen. Roadmap Presentation. Clyde R. Griffin Manager, Xen Virtualization Novell, Inc. cgriffin at novell.com.

Linux and z Systems in the Datacenter Berthold Gunreben

SUSE Linux Enterprise High Availability Extension

SUSE Linux Enterprise Kernel Back to the Future

SUSE Manager and Salt

Linux High Availability on IBM z Systems

SUSE An introduction...

Essentials. Johannes Meixner. about Disaster Recovery (abbreviated DR) with Relax-and-Recover (abbreviated ReaR)

Exploring History with Hawk

Using Linux Containers as a Virtualization Option

Introduction to Software Defined Infrastructure SUSE Linux Enterprise 15

Building a Secure and Compliant Cloud Infrastructure. Ben Goodman Principal Strategist, Identity, Compliance and Security Novell, Inc.

SaltStack and SUSE Systems and Configuration Management that Scales and is Easy to Extend

Saving Real Storage with xip2fs and DCSS. Ihno Krumreich Project Manager for SLES on System z

Build Cloud like Rackspace with OpenStack Ansible

SDS Heterogeneous OS Access. Technical Strategist

HPE Digital Learner OpenStack Content Pack

Protect your server with SELinux on SUSE Linux Enterprise Server 11 SP Sander van Vugt

SUSE OpenStack Cloud Production Deployment Architecture. Guide. Solution Guide Cloud Computing.

BOV89296 SUSE Best Practices Sharing Expertise, Experience and Knowledge. Christoph Wickert Technical Writer SUSE /

Too Many Metas A high level look at building a metadata desktop. Joe Shaw

DEEP DIVE: OPENSTACK COMPUTE

DevOps with SUSE: How SUSE Manager, SUSE Studio and SUSE Cloud APIs Facilitate Continuous Software Delivery. Wolfgang Engel.

HPE HELION CLOUDSYSTEM 9.0. Copyright 2015 Hewlett Packard Enterprise Development LP

Secure Authentication

openqa Avoiding Disasters of Biblical Proportions Marita Werner QA Project Manager

Minimal OpenStack Starting Your OpenStack Journey

Exploring the High Availability Storage Infrastructure. Tutorial 323 Brainshare Jo De Baer Technology Specialist Novell -

Open Enterprise & Open Community

SUSE Cloud Admin Appliance Walk Through. You may download the SUSE Cloud Admin Appliance the following ways.

Novell Infiniband and XEN

openqa Avoiding Disasters of Biblical Proportions Marita Werner QA Project Manager

High Availability for Enterprise Clouds: Oracle Solaris Cluster and OpenStack

클라우드스토리지구축을 위한 ceph 설치및설정

Redhat OpenStack 5.0 and PLUMgrid OpenStack Networking Suite 2.0 Installation Hands-on lab guide

THE CEPH POWER SHOW. Episode 2 : The Jewel Story. Daniel Messer Technical Marketing Red Hat Storage. Karan Singh Sr. Storage Architect Red Hat Storage

Installation runbook for Hedvig + Cinder Driver

Accelerate OpenStack* Together. * OpenStack is a registered trademark of the OpenStack Foundation

Build your own Cloud on Christof Westhues

OpenStack Architecture and Pattern Deployment with Heat. OpenStack Architecture and Pattern Deployment using Heat Ruediger Schulze

Software Defined. All The Way with OpenStack. T. R. Bosworth Senior Product Manager SUSE OpenStack Cloud

OpenStack in 10 minutes with DevStack

Part2: Let s pick one cloud IaaS middleware: OpenStack. Sergio Maffioletti

Unleash the Power of Ceph Across the Data Center

SUBSCRIPTION OVERVIEW

A fields' Introduction to SUSE Enterprise Storage TUT91098

Introduction to Neutron. Network as a Service

A Carrier-Grade Cloud Phone System

vrealize Operations Management Pack for OpenStack 5.0

VSP16. Venafi Security Professional 16 Course 04 April 2016

Overview. SUSE OpenStack Cloud Monitoring

Red Hat OpenStack Platform 10 Product Guide

What s New in Newton. Emily Hugenbruch, Advisory Software

Getting Started with an OpenStackbased Cloud Using SUSE OpenStack Cloud to Run SAP Applications

Reimagining OpenStack*

Deployment Guide for Nuage Networks VSP

Deployment Guide for Nuage Networks VSP

Contrail Cloud Platform Architecture

SICOOB. The Second Largest Linux on IBM System z Implementation in the World. Thiago Sobral. Claudio Kitayama

Contrail Cloud Platform Architecture

"Charting the Course... H8Q14S HPE Helion OpenStack. Course Summary

Huawei FusionSphere 6.0 Technical White Paper on OpenStack Integrating FusionCompute HUAWEI TECHNOLOGIES CO., LTD. Issue 01.

Samba HA Cluster on SLES 9

Fuel VMware DVS plugin testing documentation

IO110: Open Enterprise Server 2. Hardware you can hit with a hammer, software you can only curse at...

Getting Started with OpenStack

Setting Up the IBM Cloud Manager with OpenStack Cloud Manager Appliance (CMA) on z/vm

Helion OpenStack Carrier Grade 4.0 RELEASE NOTES

Choosing an Interface

An Introduction to Red Hat Enterprise Linux OpenStack Platform. Rhys Oxenham Field Product Manager, Red Hat

Building a Video Optimized Private Cloud Platform on Cisco Infrastructure Rohit Agarwalla, Technical

POWERED BY OPENSTACK. Powered by OpenStack. Globo.Tech GloboTech Communications

OpenStack End User Guide. SUSE OpenStack Cloud 8

VMware Integrated OpenStack User Guide. VMware Integrated OpenStack 4.1

From GIT to a custom OS image in a few click OS image made easy

NET1821BU THE FUTURE OF NETWORKING AND SECURITY WITH NSX-T Bruce Davie CTO, APJ 2

DEPLOYING NFV: BEST PRACTICES

HPE Helion OpenStack Carrier Grade 1.1 Release Notes HPE Helion

Cloud Computing. Amazon Web Services (AWS)

HP XP Storage Cinder Volume Driver Software User Guide

BRKDCT-1253: Introduction to OpenStack Daneyon Hansen, Software Engineer

Installation runbook for

Saving Your Bacon Recovering From Common Linux Startup Failures

Case Study on Enterprise Private Cloud

Architecture and terminology

Transcription:

Troubleshooting Your SUSE TUT6113 Cloud Paul Thompson SUSE Technical Consultant Dirk Müller SUSE OpenStack Engineer

2 SUSE Cloud...

SUSE Cloud Troubleshooting 3

SUSE Cloud 4653 Parameters 4

SUSE Cloud 14 Components 5

SUSE Cloud 2 Hours 6

SUSE Cloud Troubleshooting <1 Hour 7

SUSE Cloud Building Blocks Management Billling VM SUSE Mgmt Image SUSE Tool Portal App Manager Studio Monitor Sec & Perf Cloud Dashboard (Horizon) Cloud APIs Heat Telemetry RadosGW Required Services Message RabbitMQQ Postgresql Database Compute (Nova) Hypervisor Xen, KVM Vmware, HyperV Vmware, HyperV AUTH (Keystone) Images (Glance ) Object (Swift) Network (Neutron) Adapters Block (Cinder) Adapters RBD SUSE Linux Operating Enterprise System Server 11 SP3 Rados Physical Infrastructure: x86-64, Switches, Storage OpenStack SUSE Management Cloud AddsTools SUSE OS and Product Hypervisor Partner Physical Solutions Infrastructure Ceph 8

Non-HA SUSE Cloud Installation 9

HA SUSE Cloud Installation 10

Just Enough HA for Troubleshooting crm resource list crm_mon crm resource restart <X> crm resource cleanup <X> 11

More About HA... https://www.suse.com/documentation/sle_ha/ 12

SUSE Cloud Functional Blocks Crowbar Nova Glance. Chef OpenStack Cinder Keystone Neutron SLES 13

14 Crowbar and Chef

Generic SLES Troubleshooting All Nodes in SUSE Watch out for typical issues: Cloud are SLES11 SP3 dmesg for hardware-related errors, OOM, interesting kernel messages usual syslog targets, e.g. /var/log/messages Check general node health via: top, vmstat, uptime, pstree, free core files, zombies, etc 15

Supportconfig supportconfig can be run on any cloud node supportutils-plugin-susecloud.rpm installed on all SUSE Cloud nodes automatically collects precious cloud-specific information for further analysis 16

Typical Deployment Schema Admin Node Control Node Chef Client Chef Server Crowbar Provisioner Network Node Compute Compute Compute Cloud Node Chef Client 17

Cloud Install screen install-suse-cloud --verbose /var/log/crowbar/install.log /var/log/crowbar/barclamp_install/*.log 18

SUSE Cloud Admin Node SUSE Cloud Addon Crowbar UI Crowbar Services Chef/Rabbit Repo Mirror Install logs: /var/log/crowbar/install.log Chef/Rabbit: /var/log/rabbitmq/*.log /var/log/chef/server.log /var/log/couchdb/couchdb.log Crowbar repo server: /var/log/apache2/provisioner*log Crowbar: /var/log/crowbar/produ ction.{out,err} SLES 11 SP3 19

Chef Cloud uses Chef for almost everything: All Cloud and SLES non-core packages All config files are overwritten All daemons are started Database tables are initialized http://docs.getchef.com/chef_quick_overview.html 20

Admin Node: Using Chef knife node list knife node show <nodeid> export EDITOR=/usr/bin/vim; \ knife node edit -a <nodeid> node 21

SUSE Cloud Admin Node Populate ~root/.ssh/authorized_keys prior install Barclamp install logs: /var/log/crowbar/barclamp_install Node discovery logs: /var/log/crowbar/sledgehammer/d<macid>.<domain>.log Syslog of crowbar installed nodes sent via rsyslog to: /var/log/nodes/d<macid>.log 22

Useful Tricks Root login to the Cloud installed nodes should be possible from admin node (even in discovery stage) If admin network is reachable: ~/.ssh/config: host 192.168.124.* StrictHostKeyChecking no user root 23

SUSE Cloud Admin Node If a proposal is applied, chef client logs are at: /var/log/crowbar/chef-client/<macid>.<domain>.log Useful crowbar commands: crowbar machines help crowbar transition <node> <state> crowbar <barclamp> proposal list show <name> crowbar <barclamp> proposal delete default 24

Admin Node: Crowbar Services Nodes are deployed via PXE boot: /srv/tftpboot/discovery/pxelinux.cfg/* Installed via AutoYaST; profile generated to: /srv/tftpboot/nodes/d<mac>.<domain>/autoyast.xml Can delete & rerun chef-client on the admin node Can add useful settings to autoyast.xml: <confirm config:type="boolean">true</confirm> (don t forget to chattr +i the file) 25

Admin Node: Crowbar UI Useful Export Page available in the Crowbar UI in order to export various log files from a customer installation 26

Admin Node: Crowbar UI Raw settings in barclamp proposals allow access to "expert" (hidden) options Most interesting are: debug: true verbose: true 27

28 Admin Node: Crowbar Gotchas

Admin Node: Crowbar Gotchas Be patient Do not multiple transition nodes from one state to another Do not apply proposals while a proposal is applying Cloud nodes should boot from: 1. Network 2. First disk 29

SUSE Cloud Nodes SUSE Cloud Addon All managed via Chef: /var/log/chef/client.log rcchef-client status Cloud Node Node specific services chef-client can be invoked manually Should lock each other if maintenance updates are installed Chef Client SLES 11 SP3 30

SUSE Cloud Control Node SUSE Cloud Addon OpenStack API services.. Chef Client Just like any other cloud node: /var/log/chef/client.log rcchef-client status chef-client Control Node Chef overwrites all config files it touches chattr +i is your friend SLES 11 SP3 31

32 OpenStack Architecture Diagram

OpenStack Block diagram Accesses almost everything Keystone: SPOF 33

OpenStack Architecture Typically each OpenStack component provides: an API daemon / service one or many backend daemons that do the actual work command line client to access the API <proj>-manage client for admin-only functionality dashboard ("Horizon") Plugin providing a graphical view on the service uses an SQL database for storing state 34

OpenStack Packaging Basics Packages are usually named: openstack-<codename> usually a subpackage for each service (-api, -scheduler, etc) log to /var/log/<codename>/<service>.log each service has an init script: dde-ad-be-ff-00-01:~# rcopenstack-glance-api status Checking for service glance-api...running 35

OpenStack Debugging Basics Log files often lack useful information without verbose enabled TRACEs of processes are not logged without verbose Many reasons for API error messages are not logged unless debug is turned on Debug is very verbose (>10GB per hour) https://ask.openstack.org/ http://docs.openstack.org/icehouse/ 36

OpenStack Architecture Accesses almost everything Keystone: SPOF 37

OpenStack Dashboard: Horizon /var/log/apache2/openstack -dashboard-error_log Get the exact URL it tries to access! Enable debug in Horizon barclamp Test components individually 38

OpenStack Identity: Keystone Needed to access all services Needed by all services for checking authorisation Use keystone token-get to validate credentials and test service availability 39

OpenStack Object Store: Swift swift stat swift dispersion in Crowbar uses regular syslog for many messages: /var/log/messages console easiest to debug using curl 40

OpenStack Imaging: Glance To validate lifeness: glance image-list glance image-download <id> > /dev/null glance image-show <id> 41

OpenStack Networking: Neutron Swiss Army knife for SDN neutron agent-list neutron net-list neutron port-list neutron router-list There's no neutron-manage 42

43 Basic Network Layout

Networking with OVS: Compute Node http://docs.openstack.org/havana/config-reference/content/under_the_hood_openvswitch.html 44

45 Networking with LB: Compute Node

Neutron Troubleshooting Neutron uses IP Networking Namespaces on the Network node for routing overlapping networks neutron net-list ip netns list ip netns exec qrouter-<id> bash ping.. arping.. ip ro.. curl.. 46

OpenStack Compute: Nova nova-manage service list nova-manage logs errors nova show <id> with admin privileges shows compute node virsh list or virsh dumpxml can be used to analyze state of VM 47

Nova Overview API Scheduler Conductor Compute Compute Compute "Launches" go to Scheduler; rest to Conductor 48

49 Nova Booting VM Workflow

Nova: Scheduling a VM Nova scheduler tries to select a matching compute node for the VM 50

Nova Scheduler Typical errors: No suitable compute node can be found All suitable compute nodes failed to launch the VM with the required settings nova-manage logs errors INFO nova.filters [req-299bb909-49bc-4124-8b88-732797250cf5 c24689acd6294eb8bbd14121f68d5b44 acea50152da04249a047a52e6b02a2ef] Filter RamFilter returned 0 hosts 51

Nova Scheduler Typical errors: No suitable compute node can be found All suitable compute nodes failed to launch the VM with the required settings nova-manage logs errors INFO nova.filters [req-299bb909-49bc-4124-8b88-732797250cf5 c24689acd6294eb8bbd14121f68d5b44 acea50152da04249a047a52e6b02a2ef] Filter RamFilter returned 0 hosts 52

OpenStack Volumes: Cinder Scheduler API Volume Volume Volume Volume 53

OpenStack Cinder: Volumes Similar syntax to Nova: cinder-manage service list cinder-manage logs errors cinder-manage host list cinder list show (with admin privs) shows volume-host 54

Troubleshooting Cloud-Init OpenStack Services like Heat or Nova depend on cloud-init sets host name, ssh keys, resizes disks, launches custom scripts on boot Heat uses scripts to launch cfntools use curl on the metadata server inside the VM /var/lib/cloud/ /var/log/cloud-init.log 55

Q&A http://ask.openstack.org/ http://docs.openstack.org/ https://www.suse.com/documentation/suse-cloud4/ Thank you 56

Bonus Material

58 OpenStack Orchestration: Heat

OpenStack Orchestration: Heat Uses Nova, Cinder, Neutron to assemble complete stacks of resources heat stack-list heat resource-list show <stack> heat event-list show <stack> Usually necessary to query the actual OpenStack service for further information 59

OpenStack Imaging: Glance Usually issues are in the configured glance backend itself (e.g. RBD, swift, filesystem) so debugging concentrates on those Filesytem: RBD: /var/lib/glance/images ceph -w rbd -p <pool> ls 60

SUSE Cloud 61

Unpublished Work of SUSE. All Rights Reserved. This work is an unpublished work and contains confidential, proprietary, and trade secret information of SUSE. Access to this work is restricted to SUSE employees who have a need to know to perform tasks within the scope of their assignments. No part of this work may be practiced, performed, copied, distributed, revised, modified, translated, abridged, condensed, expanded, collected, or adapted without the prior written consent of SUSE. Any use or exploitation of this work without authorization could subject the perpetrator to criminal and civil liability. General Disclaimer This document is not to be construed as a promise by any participating company to develop, deliver, or market a product. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. SUSE makes no representations or warranties with respect to the contents of this document, and specifically disclaims any express or implied warranties of merchantability or fitness for any particular purpose. The development, release, and timing of features or functionality described for SUSE products remains at the sole discretion of SUSE. Further, SUSE reserves the right to revise this document and to make changes to its content, at any time, without obligation to notify any person or entity of such revisions or changes. All SUSE marks referenced in this presentation are trademarks or registered trademarks of Novell, Inc. in the United States and other countries. All third-party trademarks are the property of their respective owners.