<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>holyhandgrenade.org &#187; monitoring</title>
	<atom:link href="http://holyhandgrenade.org/blog/tag/monitoring/feed/" rel="self" type="application/rss+xml" />
	<link>http://holyhandgrenade.org/blog</link>
	<description>Got my two fingers out the roof see me greppin&#039; out</description>
	<lastBuildDate>Tue, 29 Nov 2011 19:44:57 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Monitoring Windows MPIO through Nagios</title>
		<link>http://holyhandgrenade.org/blog/2010/05/monitoring-windows-mpio-through-nagios/</link>
		<comments>http://holyhandgrenade.org/blog/2010/05/monitoring-windows-mpio-through-nagios/#comments</comments>
		<pubDate>Sun, 30 May 2010 18:08:44 +0000</pubDate>
		<dc:creator>Jeff</dc:creator>
				<category><![CDATA[Sysadmin]]></category>
		<category><![CDATA[monitoring]]></category>
		<category><![CDATA[nagios]]></category>
		<category><![CDATA[san]]></category>
		<category><![CDATA[windows]]></category>

		<guid isPermaLink="false">http://holyhandgrenade.org/blog/?p=615</guid>
		<description><![CDATA[<a href="http://holyhandgrenade.org/blog/2010/05/monitoring-windows-mpio-through-nagios/" title="Monitoring Windows MPIO through Nagios"></a>Sometimes, we need to do SAN maintenance &#8212; firmware upgrades, disruptive fabric changes, and the like. When these situations come up, it&#8217;s useful to know if anything is in a condition where it will break if it loses its connection &#8230;<p class="read-more"><a href="http://holyhandgrenade.org/blog/2010/05/monitoring-windows-mpio-through-nagios/">Continue reading &#187;</a></p>]]></description>
			<content:encoded><![CDATA[<a href="http://holyhandgrenade.org/blog/2010/05/monitoring-windows-mpio-through-nagios/" title="Monitoring Windows MPIO through Nagios"></a><p>Sometimes, we need to do SAN maintenance &#8212; firmware upgrades, disruptive fabric changes, and the like. When these situations come up, it&#8217;s useful to know if anything is in a condition where it will break if it loses its connection to SAN storage, especially if you&#8217;re a lowly storage administrator without admin access to any of the Windows systems connected up to the SAN.</p>
<p>I poked around, and could not find one single utility or tool for monitoring the Windows MPIO framework, so I whipped up a quick script using VBScript and WMI. The script is called like so:</p>
<p style="padding-left: 30px;">cscript.exe //NoLogo scripts\CheckMpioPaths.vbs /paths 4</p>
<p>(4 paths are used because the server is multipathed on two fabrics, and each of the active/passive controllers is also on each fabric &#8212; the server should see 2 controllers on 2 fabrics each, for 4 paths.)</p>
<p>This will cause the script to issue a Nagios CRITICAL if any multipath-registered LUN shows fewer than the given number of paths.</p>
<p>As usual, you can find the script in the <a href="http://github.com/jgoldschrafe/CheckMpioPaths">GitHub repository for CheckMpioPaths</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://holyhandgrenade.org/blog/2010/05/monitoring-windows-mpio-through-nagios/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Resilient infrastructures are only useful if they actually stay resilient</title>
		<link>http://holyhandgrenade.org/blog/2010/05/resilient-infrastructures-are-only-useful-if-they-actually-stay-resilient/</link>
		<comments>http://holyhandgrenade.org/blog/2010/05/resilient-infrastructures-are-only-useful-if-they-actually-stay-resilient/#comments</comments>
		<pubDate>Mon, 24 May 2010 21:32:53 +0000</pubDate>
		<dc:creator>Jeff</dc:creator>
				<category><![CDATA[Sysadmin]]></category>
		<category><![CDATA[ha]]></category>
		<category><![CDATA[monitoring]]></category>

		<guid isPermaLink="false">http://holyhandgrenade.org/blog/?p=591</guid>
		<description><![CDATA[<a href="http://holyhandgrenade.org/blog/2010/05/resilient-infrastructures-are-only-useful-if-they-actually-stay-resilient/" title="Resilient infrastructures are only useful if they actually stay resilient"></a>Ask yourself a question: for every piece of resiliency you supposedly have in your network, are you really positive that it&#8217;s not running in a degraded state? Really, really sure? Sometimes, it&#8217;s basic: are you being alerted when any disk &#8230;<p class="read-more"><a href="http://holyhandgrenade.org/blog/2010/05/resilient-infrastructures-are-only-useful-if-they-actually-stay-resilient/">Continue reading &#187;</a></p>]]></description>
			<content:encoded><![CDATA[<a href="http://holyhandgrenade.org/blog/2010/05/resilient-infrastructures-are-only-useful-if-they-actually-stay-resilient/" title="Resilient infrastructures are only useful if they actually stay resilient"></a><p>Ask yourself a question: for every piece of resiliency you supposedly have in your network, are you really positive that it&#8217;s not running in a degraded state? Really, <em>really</em> sure?</p>
<p>Sometimes, it&#8217;s basic: are you being alerted when any disk array attached to any server suffers a disk failure?</p>
<p>Very often, it&#8217;s not: for your SAN-attached systems, are you positive that the multipathing is green? If you&#8217;re connected to two storage processors or controllers, can the server see two paths to each of them? Are you getting alerted if you can&#8217;t?</p>
<p>Are your port channels running over the number of links that they&#8217;re supposed to? How about the ISLs on your FC fabrics?</p>
<p>If you have failover clusters where services run on preferred nodes, are you sure they&#8217;re actually located where they&#8217;re supposed to be? Are you monitoring that services are all running on their preferred nodes?</p>
<p>If you have asymmetric fall-back connections, like a gigabit switch uplink used to back up a 10-gigabit switch uplink, are you notified when it&#8217;s using the backup connection, or do you rely on your users to tell you that things seem to be running slowly?</p>
<p>There&#8217;s a difference between things running, and things running <em>smoothly</em>: making sure that your &#8220;redundant&#8221; equipment and services are actually redundant is the key to keeping issues from turning into problems.</p>
]]></content:encoded>
			<wfw:commentRss>http://holyhandgrenade.org/blog/2010/05/resilient-infrastructures-are-only-useful-if-they-actually-stay-resilient/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Charting performance data for IBM Midrange Storage Series SANs with PNP4Nagios</title>
		<link>http://holyhandgrenade.org/blog/2010/05/charting-performance-data-for-ibm-midrange-storage-series-sans-with-pnpnagios/</link>
		<comments>http://holyhandgrenade.org/blog/2010/05/charting-performance-data-for-ibm-midrange-storage-series-sans-with-pnpnagios/#comments</comments>
		<pubDate>Mon, 24 May 2010 16:08:49 +0000</pubDate>
		<dc:creator>Jeff</dc:creator>
				<category><![CDATA[Sysadmin]]></category>
		<category><![CDATA[ibm]]></category>
		<category><![CDATA[monitoring]]></category>
		<category><![CDATA[nagios]]></category>
		<category><![CDATA[pnp4nagios]]></category>
		<category><![CDATA[san]]></category>

		<guid isPermaLink="false">http://holyhandgrenade.org/blog/?p=579</guid>
		<description><![CDATA[<a href="http://holyhandgrenade.org/blog/2010/05/charting-performance-data-for-ibm-midrange-storage-series-sans-with-pnpnagios/" title="Charting performance data for IBM Midrange Storage Series SANs with PNP4Nagios"></a>If you&#8217;ve used IBM SAN products, particularly the DS4000, DS5000 and DS6000 series (which are rebranded LSI), one of the most obnoxious things about it is how you&#8217;re pretty much forced to roll your own monitoring tools. Compared to many &#8230;<p class="read-more"><a href="http://holyhandgrenade.org/blog/2010/05/charting-performance-data-for-ibm-midrange-storage-series-sans-with-pnpnagios/">Continue reading &#187;</a></p>]]></description>
			<content:encoded><![CDATA[<a href="http://holyhandgrenade.org/blog/2010/05/charting-performance-data-for-ibm-midrange-storage-series-sans-with-pnpnagios/" title="Charting performance data for IBM Midrange Storage Series SANs with PNP4Nagios"></a><p>If you&#8217;ve used IBM SAN products, particularly the DS4000, DS5000 and DS6000 series (which are rebranded LSI), one of the most obnoxious things about it is how you&#8217;re pretty much forced to roll your own monitoring tools. Compared to many mainstream vendors (and Sun/Oracle in particular), IBM&#8217;s performance monitoring and modelling tools have been lackluster at best and completely unsupplied at worst. The best tool you&#8217;ve got is the SMcli, which doesn&#8217;t supply a ton of good information, but at least provides you with a starting point for capacity planning.</p>
<p>I had originally wanted to make something like this for Cacti, which probably has a much broader install base than the pnp4nagios addon, but the Nagios way was just so <em>easy</em>, and I&#8217;d like to share it with anyone who doesn&#8217;t want to roll their own basic performance aggregator for it.</p>
<p>This tool gets the following statistics:</p>
<ul>
<li>IOPS</li>
<li>Throughput</li>
<li>Read percentage</li>
<li>Cache hit percentage</li>
</ul>
<p>It gets statistics at the following levels:</p>
<ul>
<li>Logical Unit</li>
<li>Physical Array</li>
<li>Controller</li>
<li>Unit</li>
</ul>
<p>It&#8217;s a little quick-and-dirty, but it works:</p>
<p><img class="alignnone size-medium wp-image-584" title="check_smcli_io" src="http://holyhandgrenade.org/blog/wp-content/uploads/2010/05/check_smcli_io-300x122.png" alt="check_smcli_io" width="300" height="122" /></p>
<p>Like my other projects, it&#8217;s hosted on GitHub, so check out the <a href="http://github.com/jgoldschrafe/check_smcli_io">GitHub project for check_smcli_io</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://holyhandgrenade.org/blog/2010/05/charting-performance-data-for-ibm-midrange-storage-series-sans-with-pnpnagios/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

