<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Computing@Thayer &#187; File Storage</title>
	<atom:link href="http://computing.thayer.dartmouth.edu/blog/category/file-storage/feed/" rel="self" type="application/rss+xml" />
	<link>http://computing.thayer.dartmouth.edu</link>
	<description>The blog of your Friendly Computing Services Team</description>
	<lastBuildDate>Thu, 15 Oct 2009 19:54:48 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Cheap storage Server &#8211; Part two</title>
		<link>http://computing.thayer.dartmouth.edu/blog/2009/08/26/cheap-storage-server-part-2/</link>
		<comments>http://computing.thayer.dartmouth.edu/blog/2009/08/26/cheap-storage-server-part-2/#comments</comments>
		<pubDate>Thu, 27 Aug 2009 01:18:12 +0000</pubDate>
		<dc:creator>jared</dc:creator>
				<category><![CDATA[File Storage]]></category>

		<guid isPermaLink="false">http://computing.thayer.dartmouth.edu/?p=259</guid>
		<description><![CDATA[Not all bits are created equal
There are many reasons to pay for the reliability, rich feature set, and fail-over capabilities of our NetApp.  The NetApp is an excellent product.  However, there is also a significant amount of data at Thayer that just doesn&#8217;t need all the advanced features.  For instance, we have [...]]]></description>
			<content:encoded><![CDATA[<h3>Not all bits are created equal</h3>
<p>There are many reasons to pay for the reliability, rich feature set, and fail-over capabilities of our NetApp.  The NetApp is an excellent product.  However, there is also a significant amount of data at Thayer that just doesn&#8217;t need all the advanced features.  For instance, we have research groups with multi-Terabyte data sets that just want their data easily accessible.  If it were unavailable occasionally it wouldn&#8217;t be a huge deal.  Other features such as snapshots are not critical.</p>
<p>If we can store these larger data sets on a cheap storage alternative, we can minimize the size and cost of our NetApp.</p>
<h3>Design philosophy</h3>
<p>After stalling for a couple years, hoping something would come along, we have decided to move forward with a plan that makes many compromises, but should meet the basic requirements for many of our storage needs.</p>
<p>We initially started the search to replace the NetApp altogether feature for feature.  However, over time we lowered our standards to requirements that meet most of the needs for simple storage of lots of data.  As our research continued, our philosophy on what product to choose evolved.  In the end, the three main philosophical items we looked for when choosing a cheap storage solution were:</p>
<ol>
<li><strong>Widely used technologies and standards</strong> &#8211; If lots of people are already using the technology it is more likely to be stable.  It also is likely to be around in a few years. While little start ups may have cool technology, we really don&#8217;t want to trust our data to a half-baked product.  Little companies can also get gobbled up or go out business overnight, never to be heard from again.</li>
<li><strong>Healthy communities and companies that are committed to the technology</strong> &#8211;  In case we do have questions, or run into an issue, we want an active community of people that will be able to knowledgeably answers our questions.  It isn&#8217;t enough that the technology says it supports CIFS and Active Directory.  If it doesn&#8217;t work quite right in _our_ environment, and nobody seems to know why, we just can&#8217;t be comfortable moving forward.</li>
<li><strong>Simplicity</strong> &#8211; Simple solutions have less chance of breaking and are easier to fix in the event that they do break.  It is easy to build several layers of redundancy, but end up with a system that is more likely to break because of all the moving pieces.</li>
</ol>
<h3>What we considered</h3>
<h4>ZFS on OpenSolaris/Solaris</h4>
<p><a href="http://sun.com">Link</a></p>
<p>We actually own a Sun Fire X4540 (Thor) running Solaris.  We are currently using it as our online backup for the NetApp.  It seems like a nice piece of hardware and the ZFS feature set is amazing.  Snapshots, filesystem compression, expandability, checksumming, and the rest are all great.</p>
<ul>
<li>Technically a home run.</li>
<li>Doesn&#8217;t score so well on our three design philosophy requirements.</li>
<li>Reasonable, researched questions in forums go unanswered.</li>
<li>Sun Engineers try to be helpful, but can disappear for weeks at a time.</li>
<li>Sun shunned the Samba project and instead decided to implement their own CIFS server.</li>
<li>Projects lack focus.  There are several different editions of Solaris.  They appear to be attempting to clear this up, but I really wish their &#8220;Project Indiana&#8221; broke free of a lot of the legacy issues weighing them down.</li>
<li>Storage systems must be purchased fully populated.</li>
<li>We could not get AD group integration working and ID mapping is complex.</li>
<li>Even the latest build of OpenSolaris makes you feel like you are a Sys Admin from 1999.</li>
<li>Sun laid off many employees.  They have lots of projects they are trying to maintain, but seem to be having trouble keeping up with them.</li>
<li>To make things worse, they got gobbled up by Oracle.  How important Solaris is to Oracle remains to be seen.</li>
</ul>
<h4>Sun Storage 7000 Unified Storage (Amber Road)</h4>
<p><a href="http://www.sun.com/storage/disk_systems/unified_storage/">Link</a></p>
<p>Sun realized there were admins like us who wanted ZFS but didn&#8217;t want to admin Solaris.  So they wisely made a storage appliance.  They did a great job of putting a pretty face on top of ZFS and Dtrace and then went and charged way too much for it.</p>
<ul>
<li>Price is lower than NetApp and EMC, but not low enough to build a decent sized userbase and build some critical mass.</li>
<li>Can&#8217;t replicate from an Amber Road device to our existing Sun Fire X4540.</li>
</ul>
<h4>ZFS on Nexenta</h4>
<p><a href="http://nexenta.com/">Link</a></p>
<p>Nexenta seemed like it could be a good compromise for some of our Solaris concerns.  Like Amber Road, it hides all/most of the Solaris administration and allows you to use ZFS like an appliance.  We set up Nexenta on test systems on multiple occasions and it shows a lot of promise.  Again a home run technically, but falls apart on our design philosophy.</p>
<ul>
<li>The company is very small without a very big user base.</li>
<li>If you have any Solaris specific technical problems, they basically say, &#8220;go talk to Sun&#8221;.</li>
<li>The depth and breadth of their documentation needs some major improvements.</li>
<li>Fairly inexpensive (good educational discounts) but when building cheap storage, every additional dollar pushes up the cost per gigabyte.</li>
<li>They are working like mad to add advanced features like failover.</li>
</ul>
<p>Came very close to giving them a shot, but they just need a little more time to fully bake.</p>
<h4>Clustered Samba</h4>
<p><a href="http://ctdb.samba.org/">Link</a></p>
<p>I hope this project gains momentum as we are really drawn to an open system that we can grow on demand by adding another server full of hard drives as our storage needs grow in fits and starts. </p>
<ul>
<li>Still early and <em>very</em> little documentation</li>
<li>Uses clustered file system which are still maturing and poorly supported on Ubuntu</li>
</ul>
<h4>Netgear ReadyNAS</h4>
<p><a href="http://www.readynas.com">Link</a></p>
<p>Netgear sells the ReadyNAS, a decent Linux based Network Attached Storage.  We bought a 6 bay ReadyNAS Pro which we currently use for network based Time Machine backups for a few Macs.  Bascially we wanted to get some mileage with a Linux based NAS.</p>
<ul>
<li>Decent web management interface with easy AD integration.</li>
<li>Seems to have better throughput than most small business NAS systems.</li>
<li>Tech Support is friendly but very bureaucratic.  We ran into a bug when trying to add user and group permissions to a share.  Getting tech support has been very difficult.  You basically have to give them your first born child in order to be raised to the higher level of support.  They really need to say, &#8220;yes that is a bug, we&#8217;ll fix it and get back to you&#8221;.  Instead, they had me do dozens and dozens of unrelated tweaks to no avail.  When I was finally suppose to get the next level of tech support, my ticket somehow got closed and I basically had to do the dozens of task all over again under a new ticket (which is still unresolved).</li>
<li>The largest product they sell is 12 bays.  Probably not big enough for us unless we used multiple systems.</li>
<li>It uses EXT3 filesystem, so quota is user and group based instead of directory/project based. (can be worked around with lots of groups).</li>
<li>Dollar per gigabyte is very good (currently around $0.53 per usable GB for the 3200 with 12 2TB drives in RAID 6 with 1 hot spare).</li>
</ul>
<h4>Btrfs</h4>
<p><a href="http://btrfs.wiki.kernel.org/index.php/Main_Page">Link</a></p>
<p>A new, GPL filesystem originally developed by Oracle.  All the fancy features that we want, baked into the Linux kernel. Fingers crossed that in a couple years this will have stabilized to the point that it is ready for production use.  It seems like it has the most promise of competing with NetApp&#8217;s WAFL and ZFS.  Oddly enough, Btrfs is now owned by Oracle.  Will Oracle continue to develop both ZFS and Btrfs?</p>
<h4>Other</h4>
<p>We looked at numerous other technologies in varying levels of detail. None seemed to quite have the momentum at this point, and several were just not designed for a typical file server.  We&#8217;ll keep an eye on them.</p>
<p>Some of the projects include:</p>
<ul>
<li><a href="http://www.ibrix.com">IBRIX</a> &#8211; now owned by HP</li>
<li><a href="">LeftHand Networks</a> &#8211; now owned by HP</li>
<li><a href="http://www.cleversafe.org/">Cleversafe</a></li>
<li><a href="http://www.parascale.com/">Parascale</a></li>
<li><a href="http://www.caringo.com/products_castor.html">Caringo CAStor</a></li>
</ul>
<p><a href="http://computing.thayer.dartmouth.edu/blog/2009/08/27/cheap-storage-server-project-part-three/">Read part three</a>&#8230; what we actually chose to go with (warning, it is pretty anticlimactic).</p>
]]></content:encoded>
			<wfw:commentRss>http://computing.thayer.dartmouth.edu/blog/2009/08/26/cheap-storage-server-part-2/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Cheap Storage Project &#8211; Part One</title>
		<link>http://computing.thayer.dartmouth.edu/blog/2009/08/25/cheap-storage-project-1/</link>
		<comments>http://computing.thayer.dartmouth.edu/blog/2009/08/25/cheap-storage-project-1/#comments</comments>
		<pubDate>Wed, 26 Aug 2009 00:55:59 +0000</pubDate>
		<dc:creator>jared</dc:creator>
				<category><![CDATA[File Storage]]></category>

		<guid isPermaLink="false">http://computing.thayer.dartmouth.edu/?p=255</guid>
		<description><![CDATA[Almost from the day our NetApp (ThayerFS) was powered up, we&#8217;ve been on the hunt for a cheaper file server alternative.  We&#8217;re very pleased with the functionality of the NetApp, but dollar per gigabyte and several seemingly arbitrary limitations left us curious of alternative options.
The cost of our NetApp storage is a little fuzzy. [...]]]></description>
			<content:encoded><![CDATA[<p>Almost from the day our NetApp (ThayerFS) was powered up, we&#8217;ve been on the hunt for a cheaper file server alternative.  We&#8217;re very pleased with the functionality of the NetApp, but dollar per gigabyte and several seemingly arbitrary limitations left us curious of alternative options.</p>
<p>The cost of our NetApp storage is a little fuzzy.  Over the years, we&#8217;ve come up with calculations from around 2.50 &#8211; 12 dollars per gigabyte.  But if you head over to NewEgg and buy a stand-alone 2 terabyte enterprise SATA drive, it is only $0.16 per raw gigabyte.  Obviously there is a fair amount of additional hardware and software required to make that drive available on the network.  So the question is, how can cheap can make a file server that still has respectable features, availability, and performance.</p>
<p>We&#8217;ve planned a system that comes out to about $0.28/GB raw and $0.37/GB usable.  If the system works well, our plan is to buy a near identical system that we&#8217;ll use for backup.  So backed up, the cost will be double, or about $0.84 per usable gigabyte.  While this is many times more than the cost of a raw drive, it is significantly cheaper than the alternatives that we explored.</p>
<p>Over the next few blog posts, I&#8217;ll give some details of the options we investigated, what we&#8217;ve ultimately decided to go with, and how the implementation actually works out.</p>
<p><strong>Update:</strong> The original price per GB I used was assuming we would buy a spare part of almost every electronic component (motherboard, CPU, RAM, etc.).  I&#8217;ve updated it to reflect the price without any spares parts, which is still slightly misleading, as we will buy some spare parts.  The price is also based on a fully populated system.  However, we will initially only have a system that is 25% populated.  At the beginning, this pushes the $/GB up, but because $/GB of hard drives is going down over time, by the time the system is fully populated, it will actually be lower than above quoted price.</p>
<p><a href="http://computing.thayer.dartmouth.edu/blog/2009/08/26/cheap-storage-server-part-2/">Go to part two of the Cheap Storage Project series.</a></p>
]]></content:encoded>
			<wfw:commentRss>http://computing.thayer.dartmouth.edu/blog/2009/08/25/cheap-storage-project-1/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>ThayerFS quota bump</title>
		<link>http://computing.thayer.dartmouth.edu/blog/2009/02/17/thayerfs-quota-bump/</link>
		<comments>http://computing.thayer.dartmouth.edu/blog/2009/02/17/thayerfs-quota-bump/#comments</comments>
		<pubDate>Tue, 17 Feb 2009 18:19:25 +0000</pubDate>
		<dc:creator>jared</dc:creator>
				<category><![CDATA[File Storage]]></category>

		<guid isPermaLink="false">http://computing.thayer.dartmouth.edu/?p=225</guid>
		<description><![CDATA[We&#8217;ve bumped the default ThayerFS disk quota from 512 MB up to 1 GB.  We also doubled the, &#8220;no questions asked&#8221;, maximum quota for undergraduates to 2 GB.
The &#8220;no questions asked&#8221; maximum quotas for graduate students remains at 5GB and 10GB for faculty and staff.  As always, if you have reasonable case for [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve bumped the default ThayerFS disk quota from 512 MB up to 1 GB.  We also doubled the, &#8220;no questions asked&#8221;, maximum quota for undergraduates to 2 GB.</p>
<p>The &#8220;no questions asked&#8221; maximum quotas for graduate students remains at 5GB and 10GB for faculty and staff.  As always, if you have reasonable case for needing more space, just let us know.</p>
<p>Full details about ThayerFS quotas are <a href="https://wiki.thayer.dartmouth.edu/display/computing/ThayerFS+Quotas">here</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://computing.thayer.dartmouth.edu/blog/2009/02/17/thayerfs-quota-bump/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Thayer Computing Projects</title>
		<link>http://computing.thayer.dartmouth.edu/blog/2008/04/03/thayer-computing-projects/</link>
		<comments>http://computing.thayer.dartmouth.edu/blog/2008/04/03/thayer-computing-projects/#comments</comments>
		<pubDate>Fri, 04 Apr 2008 01:29:15 +0000</pubDate>
		<dc:creator>jared</dc:creator>
				<category><![CDATA[Backups]]></category>
		<category><![CDATA[Energy]]></category>
		<category><![CDATA[File Storage]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[News and Announcements]]></category>
		<category><![CDATA[Printing]]></category>
		<category><![CDATA[Servers]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[Windows]]></category>

		<guid isPermaLink="false">http://computing.thayer.dartmouth.edu/blog/2008/04/03/thayer-computing-projects/</guid>
		<description><![CDATA[In between keeping all things computing running at Thayer, we always have several projects in the works. Here&#8217;s a quick overview of projects we&#8217;re currently working on or planning. In no particular order&#8230;
Email and collaboration tools
Dartmouth&#8217;s Council on Computing has constituted a task force to determine Dartmouth&#8217;s requirements for future email and collaboration tools. Thayer [...]]]></description>
			<content:encoded><![CDATA[<p>In between keeping all things computing running at Thayer, we always have several projects in the works. Here&#8217;s a quick overview of projects we&#8217;re currently working on or planning. In no particular order&#8230;</p>
<h2>Email and collaboration tools</h2>
<p>Dartmouth&#8217;s Council on Computing has constituted a task force to determine Dartmouth&#8217;s requirements for future email and collaboration tools. Thayer School&#8217;s Director of Computing Services is on the task force and seeks your thoughts on the subject.</p>
<h2>Hard drive based back up server</h2>
<p>We are a &#8220;belt and suspenders&#8221; computing staff. To reflect our paranoia of making sure all your ThayerFS data is safe, we&#8217;re adding another layer of redundancy to our data back up plan. We just started setting up a new &#8220;online backup server&#8221;.  It consists of fifteen 1 Terabyte hard drives which we&#8217;ll use to back up ThayerFS.  This system will eventually be located off-site.  We&#8217;ll continue to use our tape library which will be located in yet another off-site location.</p>
<h2>Weather station and Solar Panel monitoring</h2>
<p>We&#8217;re in the middle of a project to get a weather station installed on the roof of Murdough. We&#8217;ll be using the same system to monitor the output of the Solar Panels which are already installed on the roof. The plan is to make the data available on the web for those interested in local conditions and for research purposes.</p>
<h2>Spanos lighting improvements</h2>
<p>We have been working with our building manager and FO&amp;M to add special lights to properly illuminate presenters. This should improve the quality of our lecture capture video and should reduce the harsh shadows the current lighting causes on the speaker&#8217;s face.</p>
<h2>A/V system improvements, documentation, and repairs</h2>
<p>Collaborating with Instrument Room personnel, we have been working for some time now to make our classroom and meeting room audio visual systems more user-friendly. This includes simplification of touch screen controls or even their replacement with pushbutton controls, <a href="https://wiki.thayer.dartmouth.edu/display/computing/Projectors+and+AV">enhanced documentation about how to use the systems</a>, improved image quality in Spanos and C200, a combination white board/projector screen in M210, additional microphone options, improved/repaired audio, etc. Some of these improvements are finished, but we still have a long way to go on others.</p>
<h2>New Intel compiler available</h2>
<p>We purchased two floating licenses of Intel Fortran and C++ compilers for Linux. They are available on any of our Linux clients or compute servers. Also included with these licenses are the MKL and IPP libraries.</p>
<p>For more information about these new compilers and how to use them, please see our <a href="https://wiki.thayer.dartmouth.edu/display/computing/Linux+Services" target="_blank">Linux Services page</a>.</p>
<h2>New lab computers</h2>
<p>In the next few months, we plan to deploy new lab computers for the Linux lab in Cummings and to replace the ten oldest computers in MacLean 210.</p>
<h2>Adobe Contribute rollout soonish</h2>
<p>We&#8217;ve purchased licenses of Adobe Contribute, which will allow the Thayer Community to create and edit web pages in a simple WYSIWYG editor.  The experience is similar to a word processor. If you are interested in using this software, please contact us.</p>
<h2>Vista planning</h2>
<p>While we&#8217;re content with Windows XP, Windows Vista is an inevitability as XP support is dropped by Microsoft. We&#8217;re going to start looking at the best way to image and deploy Vista on lab, desktop, and laptop computers.</p>
<h2>Blade servers</h2>
<p>We recently took delivery of a new blade chassis and some new blade servers. The chassis can accommodate up to 16 servers.  The preliminary plan is to replace our aging babylon compute cluster with a couple of blades.  This will reduce space, electricity, cooling, and administration time, while increasing the computing power over our current cluster. We&#8217;ll have several empty slots available for faculty with research projects that need extra computer power.</p>
<h2>Condor High Throughput Computing</h2>
<p>We&#8217;re investigating the use of <a href="http://www.cs.wisc.edu/condor/" target="_blank">Condor High Throughput Computing</a> on our linux clusters. This would allow the Thayer community to submit compute jobs and have them distributed automatically to our compute clusters.</p>
<p>Currently, we are testing this on our limited-access sisyphus cluster, but hope to roll it out to an upgraded babylon cluster after the Spring term. Jobs run on either cluster will be able to use the other cluster&#8217;s CPUs if they are available.</p>
<h2>Hardy Heron is coming&#8230;</h2>
<p>The new version of Ubuntu, version 8.04 (hardy heron), is scheduled to be released at the end of April. This new release features many enhancements to the version we&#8217;re currently running on our linux clients, and is also the next of Ubuntu&#8217;s &#8220;long-term release&#8221; versions, which will get security updates for a longer period of time than their other versions.</p>
<p>We have been alpha (and now beta) testing this to ensure that any bugs related to our systems are reported and corrected prior to its official release. We plan to upgrade the Cummings 227 linux lab with the new version (and new computers, too!) after the Spring term. We will also be in touch with faculty and staff who have linux clients to schedule their upgrades.</p>
<h2>ThayerCups server upgrade</h2>
<p>We just upgraded the CUPS print server that handles print jobs from Mac and Linux clients. There should not be any change in the way you print from these platforms.  The new server is now easier to back up.</p>
<h2>Application Virtualization</h2>
<p>We are testing, &#8220;Application Virtualization&#8221;, a new technique for encapsulating applications to ease deployment to lab computers and client computers.</p>
<h2>Atrium Help Desk continues</h2>
<p>We continue to hold our &#8220;Atrium Help Desk&#8221; from 3:00-4:00 on Tuesdays and Thursdays.  Come by the atrium if you have any computing-related questions.</p>
]]></content:encoded>
			<wfw:commentRss>http://computing.thayer.dartmouth.edu/blog/2008/04/03/thayer-computing-projects/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ThayerFS Downtime on December 15, 2007</title>
		<link>http://computing.thayer.dartmouth.edu/blog/2007/11/19/thayerfs-downtime-on-december-15-2007/</link>
		<comments>http://computing.thayer.dartmouth.edu/blog/2007/11/19/thayerfs-downtime-on-december-15-2007/#comments</comments>
		<pubDate>Mon, 19 Nov 2007 19:35:03 +0000</pubDate>
		<dc:creator>mdailey</dc:creator>
				<category><![CDATA[File Storage]]></category>
		<category><![CDATA[News and Announcements]]></category>
		<category><![CDATA[Outage]]></category>

		<guid isPermaLink="false">http://computing.thayer.dartmouth.edu/blog/2007/11/19/thayerfs-downtime-on-december-15-2007/</guid>
		<description><![CDATA[On Saturday, December 15, 2007, Thayer Computing Services will be upgrading the physical infrastructure of our ThayerFS file server. This new system will provide larger and more cost-effective storage capacity.
In order to perform this upgrade, ThayerFS will be taken offline and will be unavailable from 8:00am until 2:00pm on December 15th. Our hope is that [...]]]></description>
			<content:encoded><![CDATA[<p>On Saturday, December 15, 2007, Thayer Computing Services will be upgrading the physical infrastructure of our ThayerFS file server. This new system will provide larger and more cost-effective storage capacity.</p>
<p>In order to perform this upgrade, ThayerFS will be taken offline and will be unavailable from 8:00am until 2:00pm on December 15th. Our hope is that the upgrade will be completed in less time than this, but it may take the full six hours. Once the system is back online, we will send e-mail to the community. Note that this outage will not effect e-mail, internet, or any other network services other than ThayerFS.</p>
<p>During this upgrade, we will also be moving the data in the Research share to different physical disks. During this move, Research will be accessible, but will be read-only. This move could take up to ten hours from the time that ThayerFS is back online, but may be done sooner. We will send separate e-mail regarding the status of this move.</p>
<p>If you have any questions, please feel free to e-mail computing@thayer.dartmouth.edu.</p>
<p>Your Friendly Computing Services Team</p>
]]></content:encoded>
			<wfw:commentRss>http://computing.thayer.dartmouth.edu/blog/2007/11/19/thayerfs-downtime-on-december-15-2007/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Backup Program Chosen: AMANDA</title>
		<link>http://computing.thayer.dartmouth.edu/blog/2007/05/28/backup-program-chosen-amanda/</link>
		<comments>http://computing.thayer.dartmouth.edu/blog/2007/05/28/backup-program-chosen-amanda/#comments</comments>
		<pubDate>Mon, 28 May 2007 17:01:55 +0000</pubDate>
		<dc:creator>jordan</dc:creator>
				<category><![CDATA[Backups]]></category>
		<category><![CDATA[File Storage]]></category>

		<guid isPermaLink="false">http://computing.thayer.dartmouth.edu/blog/2007/05/28/backup-program-chosen-amanda/</guid>
		<description><![CDATA[After performance and feature testing AMANDA and Bacula, we&#8217;ve decided to go with AMANDA. The speed that AMANDA was able to backup data over the network was nearly twice that of Bacula (80 MB/s compared to 40 MB/s), and AMANDA was slightly faster pushing data to tape (77 MB/s compared to 63 MB/s). The speed [...]]]></description>
			<content:encoded><![CDATA[<p>After performance and feature testing AMANDA and Bacula, we&#8217;ve decided to go with AMANDA. The speed that AMANDA was able to backup data over the network was nearly twice that of Bacula (80 MB/s compared to 40 MB/s), and AMANDA was slightly faster pushing data to tape (77 MB/s compared to 63 MB/s). The speed advantages that AMANDA showed within our topology were significant enough to warrant it as our backup program of choice. That being said, within other topologies and with different needs, I would suggest looking closely at both programs, as both are excellent and have different strenghts. </p>
]]></content:encoded>
			<wfw:commentRss>http://computing.thayer.dartmouth.edu/blog/2007/05/28/backup-program-chosen-amanda/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>AMANDA and Bacula: which backup program to choose?</title>
		<link>http://computing.thayer.dartmouth.edu/blog/2007/05/15/amanda-and-bacula-which-backup-program-to-choose/</link>
		<comments>http://computing.thayer.dartmouth.edu/blog/2007/05/15/amanda-and-bacula-which-backup-program-to-choose/#comments</comments>
		<pubDate>Wed, 16 May 2007 00:25:40 +0000</pubDate>
		<dc:creator>jordan</dc:creator>
				<category><![CDATA[Backups]]></category>
		<category><![CDATA[File Storage]]></category>

		<guid isPermaLink="false">http://computing.thayer.dartmouth.edu/blog/index.php/2007/05/15/amanda-and-bacula-which-backup-program-to-choose/</guid>
		<description><![CDATA[An important part of any enterprise level storage system is the backup system. For our new inexpensive storage system, we want to find a piece of backup software that is both inexpensive (preferably free) as well as functional. My search for such a program has led to two main candidates: AMANDA and Bacula. I&#8217;ll start [...]]]></description>
			<content:encoded><![CDATA[<p>An important part of any enterprise level storage system is the backup system. For our new inexpensive storage system, we want to find a piece of backup software that is both inexpensive (preferably <a href="http://www.fsf.org/">free</a>) as well as functional. My search for such a program has led to two main candidates: <a href="http://www.amanda.org">AMANDA</a> and <a href="http://www.bacula.org">Bacula</a>. I&#8217;ll start with AMANDA as that is where my testing began.</p>
<p>AMANDA approaches backups by using native programs such as postfix, tar, dump, xfsdump, etc within a client/server relationship. The catalog database is store in flat tar.gz files instead of a more complicated (but perhaps more scalable) database such as MySQL. AMANDA&#8217;s claim to fame is its scheduler. You tell AMANDA you backup cycle (say 30 days), the set of clients/directories/files you want to backup, and she takes it from there, automatically balancing backups (both full and incremental) throughout that time period. While this is great for load balancing, as I understand it, the administrator is only guaranteed that backups will happen during the cycle, not exactly when those backups will happen. AMANDA is also very fast. In my testing (two Dell PE2850 machines running <a href="http://www.ubuntu.com">Ubuntu</a> over Gig-E networking with an IBM LTO-3 tape drive) the backup averaged ~50 MB/s between the client (to the holding disk) and then ~77 MB/s from holding disk to tape. Where AMANDA shows her age is dealing with tape libraries. The only way I have found to handle a tape library with more than one drive (ours has four) is to create separate AMANDA configurations, each with its own tape drive and set of slots in the library. This negates much of the smart scheduling AMANDA provides, as you have to manually balance which directories/file to backup in each AMANDA configuration. Another gotcha with AMANDA is the inability to append data to tapes. Overall, AMANDA is simple and fast, but makes coping with a large tape library somewhat cumbersome. </p>
<p>Bacula is a holistic backup solution. Everything needed to use Bacula is included with Bacula. Bacula is implemented with a client/server relationship, using its own protocols and your choice of MySQL, PostgeSQL, or SQLite to store the catalog in. Scheduling is done in the standard fashion of telling the program what days/weeks/months you want fulls and incrementals to occur. Bacula handles tape libraries very well, allowing multiple tape drives to co-exist in a single configuration. Performance is adequate. Using the same hardware and OS as in the AMANDA tests. The same backup (~90 GB) that was used in the AMANDA tests averaged ~30 MB/s from client to holding disk, and then ~63 MB/s from holding disk to tape. Overall Bacula provides a slick program, great documentation and support for advanced features, at the price of some speed. </p>
<p>In conclusion, AMANDA and Bacula are both excellent programs, with different strengths. From my research, the difference in network speed between AMANDA and Bacula is due to Bacula pumping everything through one TCP port, while AMANDA fires up several dumpers, using many TCP ports to transfer the data. When backup up more than one client, the aggregate speed Bacula provides increases. </p>
]]></content:encoded>
			<wfw:commentRss>http://computing.thayer.dartmouth.edu/blog/2007/05/15/amanda-and-bacula-which-backup-program-to-choose/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
