<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Tiger Technologies Blog &#187; System Status</title>
	<atom:link href="http://blog.tigertech.net/category/system-status/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.tigertech.net</link>
	<description>Behind the scenes at tigertech.net</description>
	<lastBuildDate>Thu, 23 May 2013 06:18:21 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Brief scheduled maintenance on web05 &amp; web07 servers May 22, 2013</title>
		<link>http://blog.tigertech.net/posts/maintenance-20130522/</link>
		<comments>http://blog.tigertech.net/posts/maintenance-20130522/#comments</comments>
		<pubDate>Tue, 21 May 2013 06:08:41 +0000</pubDate>
		<dc:creator>Robert Mathews</dc:creator>
				<category><![CDATA[System Status]]></category>
		<category><![CDATA[maintenance]]></category>
		<category><![CDATA[status]]></category>
		<category><![CDATA[web05]]></category>
		<category><![CDATA[web07]]></category>

		<guid isPermaLink="false">http://blog.tigertech.net/?p=3168</guid>
		<description><![CDATA[Between 10:00 PM and 10:59 PM Pacific time Wednesday May 22, 2013, the &#8220;web05&#8221; and &#8220;web07&#8221; servers will be restarted. This will cause an eight minute interruption of service for each server at some point during this hour. Other servers will not be affected. Mail for customers on these servers will be queued and delivered [...]]]></description>
				<content:encoded><![CDATA[<p>Between 10:00 PM and 10:59 PM Pacific time Wednesday May 22, 2013, the &#8220;<a href="/posts/which-server">web05</a>&#8221; and &#8220;<a href="/posts/which-server">web07</a>&#8221; servers will be restarted. This will cause an eight minute interruption of service for each server at some point during this hour.</p>
<p><span id="more-3168"></span></p>
<p>Other servers will not be affected. Mail for customers on these servers will be queued and delivered after a short delay.</p>
<p>We apologize for the inconvenience this causes. This is necessary to allow our technicians to perform hardware maintenance on these servers, which includes replacement of RAID array disks and the addition of RAM memory.</p>
<p><i>Update 11:15 PM:</i> Both servers were upgraded. The upgrade of web05 took longer than expected, for which we apologize.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.tigertech.net/posts/maintenance-20130522/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>High load on web04 server May 9 2013 (resolved)</title>
		<link>http://blog.tigertech.net/posts/status-201305090800/</link>
		<comments>http://blog.tigertech.net/posts/status-201305090800/#comments</comments>
		<pubDate>Thu, 09 May 2013 15:00:57 +0000</pubDate>
		<dc:creator>Robert Mathews</dc:creator>
				<category><![CDATA[System Status]]></category>
		<category><![CDATA[status]]></category>
		<category><![CDATA[web04]]></category>

		<guid isPermaLink="false">https://blog.tigertech.net/?p=3151</guid>
		<description><![CDATA[The &#8220;web04&#8221; server experienced extremely high load for several minutes beginning at 8:00 AM Pacific time on May 9. Sites on this server were slow or unavailable as a result. This was caused by a single site making &#8220;runaway&#8221; database queries that left almost no MySQL &#8220;cache&#8221; memory available for other queries. The problem has [...]]]></description>
				<content:encoded><![CDATA[<p>The &#8220;<a href="/posts/which-server">web04</a>&#8221; server experienced extremely high load for several minutes beginning at 8:00 AM Pacific time on May 9. Sites on this server were slow or unavailable as a result.</p>
<p><span id="more-3151"></span></p>
<p>This was caused by a single site making &#8220;runaway&#8221; database queries that left almost no MySQL &#8220;cache&#8221; memory available for other queries. The problem has been resolved by suspending the site involved, and we are analyzing how to prevent anything similar from happening in the future.</p>
<p>We apologize for this incident; we take reliability seriously and strive to avoid this kind of problem.</p>
<p><em>Followup: We have made a technical change that will prevent this from recurring. Our MySQL servers are configured to write temporary tables to /dev/shm, which defaults to 12 GB in size on our 24 GB RAM servers. This effectively allowed runaway queries to use up to 12 GB of RAM, emptying much of  server&#8217;s general file cache. We have lowered the size of /dev/shm to a maximum of 6 GB, ensuring that the file cache doesn&#8217;t empty out and cause load spikes.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.tigertech.net/posts/status-201305090800/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Slow performance on web04 server April 11, 2013 (resolved)</title>
		<link>http://blog.tigertech.net/posts/status-201304111331/</link>
		<comments>http://blog.tigertech.net/posts/status-201304111331/#comments</comments>
		<pubDate>Thu, 11 Apr 2013 20:31:21 +0000</pubDate>
		<dc:creator>System Status</dc:creator>
				<category><![CDATA[System Status]]></category>
		<category><![CDATA[status]]></category>
		<category><![CDATA[web04]]></category>

		<guid isPermaLink="false">https://blog.tigertech.net/?p=3116</guid>
		<description><![CDATA[1:31 PM Pacific time: Our technicians are investigating high load and slow page load times on the &#8220;web04&#8221; server. 2:09 PM Pacific time: This is being caused by a distributed denial of service attack on WordPress sites that is causing outages for many companies. We&#8217;re working to block it. 2:36 PM Pacific time: The attack [...]]]></description>
				<content:encoded><![CDATA[<p><em>1:31 PM Pacific time</em>: Our technicians are investigating high load and slow page load times on the &#8220;<a href="/posts/which-server">web04</a>&#8221; server.</p>
<p><em>2:09 PM Pacific time</em>: This is being caused by a <a href="http://www.webhostingtalk.com/showthread.php?t=1255387">distributed denial of service attack on WordPress sites</a> that is causing outages for many companies. We&#8217;re working to block it.</p>
<p><span id="more-3116"></span></p>
<p><em>2:36 PM Pacific time</em>: The attack has been successfully blocked and all services are now working normally.</p>
<p>We sincerely apologize for the inconvenience this problem caused our customers.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.tigertech.net/posts/status-201304111331/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Outage on web12 server April 9, 2013 (resolved)</title>
		<link>http://blog.tigertech.net/posts/status-201304091301/</link>
		<comments>http://blog.tigertech.net/posts/status-201304091301/#comments</comments>
		<pubDate>Tue, 09 Apr 2013 20:01:13 +0000</pubDate>
		<dc:creator>System Status</dc:creator>
				<category><![CDATA[System Status]]></category>
		<category><![CDATA[all servers]]></category>
		<category><![CDATA[status]]></category>
		<category><![CDATA[web12]]></category>

		<guid isPermaLink="false">https://blog.tigertech.net/?p=3110</guid>
		<description><![CDATA[Between 12:50 and 1:23 PM Pacific time, service was intermittently unavailable or slow for sites and e-mail on the web12 server. In addition, customers on other servers may have seen brief delays or high load for about two minutes during this period. This problem was caused by a brief period of high network latency to [...]]]></description>
				<content:encoded><![CDATA[<p>Between 12:50 and 1:23 PM Pacific time, service was intermittently unavailable or slow for sites and e-mail on the <a href="/posts/which-server">web12</a> server. In addition, customers on other servers may have seen brief delays or high load for about two minutes during this period.</p>
<p><span id="more-3110"></span></p>
<p>This problem was caused by a brief period of high network latency to some destinations. That caused a larger-than-usual number of PHP processes to start, leading to reduced memory available for file system caching. This in turn made the server respond more slowly than usual, which caused even more PHP processes to start to handle the incoming requests. This made the problem worse in a &#8220;vicious circle&#8221; until we could manually limit the number of PHP processes being started.</p>
<p>The web12 server appears to be more vulnerable to this problem than other servers because of its PHP script usage pattern. While the number of PHP processes on all our servers increased, the problem was just bad enough on web12 that it couldn&#8217;t recover from it gracefully. We haven&#8217;t seen this particular issue happen before.</p>
<p>We are making immediate changes to the way PHP processes are started and limited to ensure this problem does not recur.</p>
<p>We sincerely apologize for this. We know you count on us for reliable service, and we are constantly striving to avoid this kind of problem.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.tigertech.net/posts/status-201304091301/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Network outage March 23 2013 (resolved)</title>
		<link>http://blog.tigertech.net/posts/status-201303232320/</link>
		<comments>http://blog.tigertech.net/posts/status-201303232320/#comments</comments>
		<pubDate>Sun, 24 Mar 2013 06:20:47 +0000</pubDate>
		<dc:creator>System Status</dc:creator>
				<category><![CDATA[System Status]]></category>
		<category><![CDATA[all servers]]></category>
		<category><![CDATA[status]]></category>

		<guid isPermaLink="false">https://blog.tigertech.net/?p=3103</guid>
		<description><![CDATA[Between 11:04 PM and 11:44 PM March 23, our network was either slow to respond due to high packet loss or completely unavailable to some customers. This was due to a hardware failure at one of our upstream network partner companies. To work around the problem, our technicians had to manually close down the network [...]]]></description>
				<content:encoded><![CDATA[<p>Between 11:04 PM and 11:44 PM March 23, our network was either slow to respond due to high packet loss or completely unavailable to some customers.</p>
<p><span id="more-3103"></span></p>
<p>This was due to a hardware failure at one of our upstream network partner companies. To work around the problem, our technicians had to manually close down the network session with that company to re-route all network traffic.</p>
<p>The problem has now been completely resolved. The upstream provider located and has replaced the faulty equipment. We sincerely apologize for the trouble this caused our customers.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.tigertech.net/posts/status-201303232320/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Brief performance problem on web12 server March 4, 2013 (resolved)</title>
		<link>http://blog.tigertech.net/posts/web12-20130304/</link>
		<comments>http://blog.tigertech.net/posts/web12-20130304/#comments</comments>
		<pubDate>Mon, 04 Mar 2013 18:30:11 +0000</pubDate>
		<dc:creator>Robert Mathews</dc:creator>
				<category><![CDATA[System Status]]></category>
		<category><![CDATA[mysql]]></category>
		<category><![CDATA[status]]></category>
		<category><![CDATA[web12]]></category>

		<guid isPermaLink="false">http://blog.tigertech.net/?p=3077</guid>
		<description><![CDATA[There was a brief but severe performance problem on the web12 server today between 9:59 and 10:07 AM Pacific time. During this time, many Web server requests were very slow to load or even &#8220;timed out&#8221; completely. All services are now operating normally again. Other servers were not affected. This problem was caused by high [...]]]></description>
				<content:encoded><![CDATA[<p>There was a brief but severe performance problem on the <a href="/posts/which-server">web12</a> server today between 9:59 and 10:07 AM Pacific time. During this time, many Web server requests were very slow to load or even &#8220;timed out&#8221; completely. All services are now operating normally again. Other servers were not affected.</p>
<p><span id="more-3077"></span></p>
<p>This problem was caused by high database load due to a customer making &#8220;runaway&#8221; database queries on that server. We are investigating the details to avoid future problems, and we apologize to affected customers.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.tigertech.net/posts/web12-20130304/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Brief scheduled maintenance February 26 2013 (completed)</title>
		<link>http://blog.tigertech.net/posts/maintenance-2013022/</link>
		<comments>http://blog.tigertech.net/posts/maintenance-2013022/#comments</comments>
		<pubDate>Tue, 26 Feb 2013 23:53:04 +0000</pubDate>
		<dc:creator>Robert Mathews</dc:creator>
				<category><![CDATA[System Status]]></category>
		<category><![CDATA[all servers]]></category>
		<category><![CDATA[maintenance]]></category>
		<category><![CDATA[security]]></category>
		<category><![CDATA[status]]></category>

		<guid isPermaLink="false">http://blog.tigertech.net/?p=3068</guid>
		<description><![CDATA[Between 11:00 PM and 11:59 PM Pacific time February 26, 2013, each of our servers will be restarted for a &#8220;kernel upgrade&#8221;. This will cause an approximately four minute interruption of service for each customer at some point during this hour. During that four minute period, customers will not be able to use their Web [...]]]></description>
				<content:encoded><![CDATA[<p>Between 11:00 PM and 11:59 PM Pacific time February 26, 2013, each of our servers will be restarted for a &#8220;kernel upgrade&#8221;. This will cause an approximately four minute interruption of service for each customer at some point during this hour.</p>
<p><span id="more-3068"></span></p>
<p>During that four minute period, customers will not be able to use their Web sites or read e-mail. E-mail that arrives during this period will be queued and redelivered after the maintenance, not lost.</p>
<p>This maintenance and restart of all servers is necessary for <a href="http://www.debian.org/security/2013/dsa-2632">security reasons</a>. We apologize for the inconvenience this causes, together with the short notice (this issue is important enough that the patch should be applied as soon as it&#8217;s available, which is today).</p>
<p><em>Update 11:30 PM Pacific time: The maintenance was completed with less than 3 minutes downtime for all servers except <a href="/posts/which-server">web01</a>, which took a few minutes longer because the Apache software on that server needed manually starting due to a technical problem that we have permanently resolved.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.tigertech.net/posts/maintenance-2013022/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Outage on web12 server (resolved)</title>
		<link>http://blog.tigertech.net/posts/status-201302071835/</link>
		<comments>http://blog.tigertech.net/posts/status-201302071835/#comments</comments>
		<pubDate>Fri, 08 Feb 2013 02:35:56 +0000</pubDate>
		<dc:creator>System Status</dc:creator>
				<category><![CDATA[System Status]]></category>
		<category><![CDATA[status]]></category>
		<category><![CDATA[web12]]></category>

		<guid isPermaLink="false">https://blog.tigertech.net/?p=3040</guid>
		<description><![CDATA[There was a brief outage on the web12 server today starting at about 6:22 PM Pacific time. This was caused by a “SYN flood” attack, which effectively blocked all other connections with the server. We took steps to work around the attack, which we completed by 7:08 PM Pacific time (46 minutes after the start [...]]]></description>
				<content:encoded><![CDATA[<p>There was a brief outage on the <a href="/posts/which-server">web12</a> server today starting at about 6:22 PM Pacific time. This was caused by a “SYN flood” attack, which effectively blocked all other connections with the server.</p>
<p>We took steps to work around the attack, which we completed by 7:08 PM Pacific time (46 minutes after the start of the attack). Furthermore, the attack itself seems to have stopped; the steps we took should help in case in starts again.</p>
<p>We sincerely apologize for the interruption in service for those affected customers; we know that reliable service is a primary concern for all of our customers.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.tigertech.net/posts/status-201302071835/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>web03 server restarted (resolved)</title>
		<link>http://blog.tigertech.net/posts/web03-server-20130206/</link>
		<comments>http://blog.tigertech.net/posts/web03-server-20130206/#comments</comments>
		<pubDate>Thu, 07 Feb 2013 06:05:19 +0000</pubDate>
		<dc:creator>Robert Mathews</dc:creator>
				<category><![CDATA[System Status]]></category>
		<category><![CDATA[status]]></category>
		<category><![CDATA[web03]]></category>

		<guid isPermaLink="false">http://blog.tigertech.net/?p=3038</guid>
		<description><![CDATA[At 9:45 PM Pacific time February 6 2013, our &#8220;web03&#8221; server experienced a &#8220;kernel panic&#8221; and needed to be restarted. This led to an 11 minute outage of Web sites and e-mail hosted on that server. All services are now working normally, and other servers were not affected. We apologize for the trouble this caused [...]]]></description>
				<content:encoded><![CDATA[<p>At 9:45 PM Pacific time February 6 2013, our &#8220;<a href="/posts/which-server">web03</a>&#8221; server experienced a &#8220;kernel panic&#8221; and needed to be restarted. This led to an 11 minute outage of Web sites and e-mail hosted on that server.</p>
<p>All services are now working normally, and other servers were not affected. We apologize for the trouble this caused customers on the web03 server.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.tigertech.net/posts/web03-server-20130206/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Denial of service attack February 5, 2013 (resolved)</title>
		<link>http://blog.tigertech.net/posts/status-201302051504/</link>
		<comments>http://blog.tigertech.net/posts/status-201302051504/#comments</comments>
		<pubDate>Tue, 05 Feb 2013 23:04:18 +0000</pubDate>
		<dc:creator>Robert Mathews</dc:creator>
				<category><![CDATA[System Status]]></category>
		<category><![CDATA[all servers]]></category>
		<category><![CDATA[status]]></category>
		<category><![CDATA[web11]]></category>

		<guid isPermaLink="false">https://blog.tigertech.net/?p=3029</guid>
		<description><![CDATA[Beginning at 3:00 PM Pacific time February 5, a server on our network was the target of an extremely high volume DNS amplification denial of service attack. The inbound network data exceeded 11.6 Gbps, which is an extremely large amount &#8212; large enough to exceed the 10 Gpbs capacity of our upstream Ethernet switches and [...]]]></description>
				<content:encoded><![CDATA[<p>Beginning at 3:00 PM Pacific time February 5, a server on our network was the target of an extremely high volume DNS amplification denial of service attack. The inbound network data exceeded 11.6 Gbps, which is an extremely large amount &#8212; large enough to exceed the 10 Gpbs capacity of our upstream Ethernet switches and cause our entire network to slow down dramatically.</p>
<p>This affected all servers for about 19 minutes, until we and our network partners began discarding (&#8220;null routing&#8221;) all traffic targeted at that server. This fixed the problem for the rest of our network, but still left sites on the &#8220;web11&#8243; server unavailable.</p>
<p>To solve that, the IP addresses of all sites on the web11 server have been changed to new IP addresses that are working correctly and are not under attack. This was completed by 3:44 PM, and all sites on all servers are now working properly.</p>
<p>If the attackers target another IP address, we&#8217;re ready to immediately block that one, too. If that does happen, the way we&#8217;ve redistributed the IP addresses, in combination with previous analysis we&#8217;ve done on this attack, will allow us to immediately know which site is under attack. (It&#8217;s otherwise hard to determine which IP address is involved, because the type of attack we&#8217;re seeing targets only an IP address and not a specific Web site name.) That site will then be moved off our main network to prevent a recurrence.</p>
<p>We sincerely apologize for the inconvenience this caused our customers; we know you count on us for reliable service, and we&#8217;re committed to doing everything possible to avoid problems.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.tigertech.net/posts/status-201302051504/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic page generated in 3.709 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2013-05-22 23:53:10 -->
