<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Boxes and Arrows &#187; Findability</title>
	<atom:link href="http://boxesandarrows.com/category/findability/feed/" rel="self" type="application/rss+xml" />
	<link>http://boxesandarrows.com</link>
	<description>Boxes and Arrows is devoted to the practice, innovation, and discussion of design; including graphic design, interaction design, information architecture and the design of business.</description>
	<lastBuildDate>Tue, 21 May 2013 13:57:51 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<item>
		<title>Tree Testing</title>
		<link>http://boxesandarrows.com/tree-testing/</link>
		<comments>http://boxesandarrows.com/tree-testing/#comments</comments>
		<pubDate>Sat, 05 Dec 2009 08:02:46 +0000</pubDate>
		<dc:creator>Dave OBrien</dc:creator>
				<category><![CDATA[Discovery, Research, and Testing]]></category>
		<category><![CDATA[Findability]]></category>
		<category><![CDATA[Methods]]></category>
		<category><![CDATA[Software and Tools]]></category>

		<guid isPermaLink="false">http://boxesandarrows.com/tree-testing/</guid>
		<description><![CDATA[After hearing about Donna Spencer's paper-based tree testing, Dave O'Brien and his colleagues were hooked, so much so that they built an online tool to allow you to effectively test a site hierarchy. Hear about the tree testing approach and see how O'Brien's tool works.]]></description>
				<content:encoded><![CDATA[<p>A big part of information architecture is <b>organisation</b> – creating the structure of a site. For most sites – particularly large ones – this means creating a hierarchical “tree” of topics.</p>
<p>But to date, the IA community hasn’t found an effective, simple technique (or tool) to <b>test site structures</b>. The most common method used &#8212; closed card sorting &#8212; is neither widespread nor particularly suited to this task.</p>
<p>Some years ago, Donna Spencer pioneered a simple paper-based technique to test trees of topics. Recent refinements to that method, some made possible by online experimentation, have now made “tree testing” more effective and agile.<br />
<br />
<h3>How it all began</h3>
<p>Some time ago, we were working on an information-architecture project for a large government client here in New Zealand. It was a classic IA situation – their current site’s structure (the hierarchical “tree” of topics) was a mess, they knew they had outgrown it, and they wanted to start fresh.</p>
<p>We jumped in and did some research, including card-sorting exercises with various user groups. We’ve always found card sorts (in person or online) to be a great way to generate ideas for a new IA.</p>
<p>Brainstorming sessions followed, and we worked with the client to come up with several possible new site trees. But were they better than the old one? And which new one was best? After a certain amount of debate, it became clear that debate wasn’t the way to decide. <b>We needed some real data – data from users.</b> And, like all projects, we needed it <b>quickly</b>.</p>
<p>What kind of data? At this early stage, we weren’t concerned with visual design or navigation methods; we <b>just wanted to test organisation – specifically, findability and labeling</b>. We wanted to know:<br />
* Could users successfully find particular items in the tree?<br />
* Could they find those items directly, without having to backtrack?<br />
* Could they choose between topics quickly, without having to think too much (the Krug Test)<sup><a href="#fn1">1</a></sup>?<br />
* Overall, which parts of the tree worked well, and which fell down?</p>
<p>Not only did we want to test each proposed tree, we wanted to <b>test them against each other</b>, so we could pick the best ideas from each.</p>
<p>And finally, we needed to <b>test the proposed trees against the existing tree</b>. After all, we hadn’t just contracted to deliver a different IA – we had promised a better IA, and we needed a quantifiable way to prove it.<br />
</p>
<h3>The problem</h3>
<p>This, then, was our IA challenge:<br />
* getting <b>objective data</b> on the <b>relative effectiveness</b> of several tree structures<br />
* getting it done <b>quickly</b>, without having to build the actual site first.</p>
<p>As mentioned earlier, we had already used <b>open card sorting</b> to generate ideas for the new site structure. We had done in-person sorts (to get some of the “why” behind our users’ mental models) as well as online sorts (to get a larger sample from a wider range of users).</p>
<p>But while open card sorting is a good “detective” technique, it doesn’t yield the final site structure &#8211; it just provides clues and ideas. And it certainly doesn’t help in evaluating structures.</p>
<p>For that, information architects have traditionally turned to <b>closed card sorting</b>, where the user is provided with predefined category “buckets” and ask to sort a pile of content cards into those buckets. The thinking goes that if there is general agreement about which cards go in which buckets, then the buckets (the categories) should perform well in the delivered IA.</p>
<p>The problem here is that, while closed card sorting mimics how users may file a particular item of content (e.g. where they might store a new document in a document-management system), it doesn’t necessarily model how users find information in a site. They don’t start with a document &#8212; <b>they start with a task</b>, just as they do in a usability test.</p>
<p>What we wanted was <b>a technique that more closely simulates how users browse</b> sites when looking for something specific. Yes, closed card sorting was better than nothing, but it just didn’t feel like the right approach.</p>
<p>Other information architects have grappled with this same problem. We know some who wait until they are far enough along in the wireframing process that they can include some IA testing in the first rounds of usability testing. That piggybacking saves effort, but it also means that we don’t get to evaluate the IA until later in the design process, which means more risk.</p>
<p>We know others who have thrown together quick-and-dirty HTML with a proposed site structure and placeholder content. This lets them run early usability tests that focus on how easily participants can find various sublevels of the site. While that gets results sooner, it also means creating a throw-away set of pages and running an extra round of user testing.</p>
<p>With these needs in mind, we looked for a new technique – one that could:<br />
* Test topic trees for effective organisation<br />
* Provide a way to compare alternative trees<br />
* Be set up and run with minimal time and effort<br />
* Give clear results that could be acted on quickly</p>
<p>
<h3>The technique &#8212; tree testing</h3>
<p>Luckily, the technique we were looking for already existed. Even luckier was that we got to hear about it firsthand from its inventor, Donna Spencer, the well-regarded information architect out of Australia, and author of the recently released book &#8220;Card Sorting&#8221;:http://rosenfeldmedia.com/books/cardsorting/.</p>
<p>During an IA course that Donna was teaching, she was asked how she tested the site structures she created for clients. She mentioned closed card sorting, but like us, she wasn’t satisfied with it.</p>
<p>She then went on to describe a technique she called &#8220;card-based classification&#8221;:http://www.boxesandarrows.com/view/card_based_classification_evaluation, which she had used on some of her IA projects. Basically, it involved modeling the site structure on index cards, then giving participants a “find-it” task and asking them to navigate through the index cards until they found what they were looking for.</p>
<p>To test a shopping site, for example, she might give them a task like “Your 9-year-old son asks for a new belt with a cowboy buckle”. She would then show them an index card with the top-level categories of the site:</p>
<p>
<img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/tree-testing/card1.top-levelcategories.jpg" width="252" height="165" alt="She would then show them an index card with the top-level categories of the site." title="She would then show them an index card with the top-level categories of the site."/></p>
<p>
The participant would choose a topic from that card, leading to another index card with the subtopics under that topic.</p>
<p>
<img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/tree-testing/card2.selectcategory.jpg" width="416" height="217" alt=" The participant would choose a topic from that card, leading to another index card with the subtopics under that topic." title=" The participant would choose a topic from that card, leading to another index card with the subtopics under that topic."/></p>
<p>
The participant would continue choosing topics, moving down the tree, until they found their answer. If they didn’t find a topic that satisfied them, they could backtrack (go back up one or more levels). If they still couldn’t find what they were looking for, they could give up and move on to the next task.</p>
<p>During the task, the moderator would record:<br />
* the path taken through the tree (using the reference numbers on the cards)<br />
* whether the participant found the correct topic<br />
* where the participant hesitated or backtracked</p>
<p>By choosing a small number of representative tasks to try on participants, Donna found that she could quickly determine which parts of the tree performed well and which were letting the side down. And she could do this without building the site itself – all that was needed was a textual structure, some tasks, and a bunch of index cards.</p>
<p>Donna was careful to point out that this technique <b>only tests the top-down organisation of a site and the labeling of its topics</b>. It does not try to include other factors that affect findability, such as:<br />
* the visual design and layout of the site<br />
* other navigation routes (e.g. cross links)<br />
* search</p>
<p>While it’s true that this technique does not measure everything that determines a site’s ease of browsing, that can also be a strength. By <b>isolating the site structure</b> &#8211; by removing other variables at this early stage of design &#8211; we can more clearly see how the tree itself performs, and revise until we have a solid structure. We can then move on in the design process with confidence. It’s like unit-testing a site’s organisation and labeling. Or as my colleague Sam Ng says, “Think of it as analytics for a website you haven’t built yet.”<br />
<br />
<h3>So we built Treejack</h3>
<p>As we started experimenting with “card-based classification” on paper, it became clear that, <b>while the technique was simple, it was tedious</b> to create the cards on paper, recruit participants, record the results manually, and enter the data into a spreadsheet for analysis. The steps were easy enough, but they were time eaters.</p>
<p>It didn’t take too much to <b>imagine all this turned into a web app</b> – both for the information architect running the study and the participant browsing the tree. Card sorting had gone online with good results, so why not card-based classification?</p>
<p>Ah yes, that was the other thing that needed work – the name. During the paper exercises, it got called “tree testing”, and because that seemed to stick with participants and clients, it stuck with us. And it sure is a lot easier to type.</p>
<p>To create a good web app, we knew we had to be absolutely clear about what it was supposed to do. For online tree testing, we aimed for something that was:<br />
* Quick for an information architect to learn and get going on<br />
* Simple for participants to do the test<br />
* Able to handle a large sample of users<br />
* Able to present clear results</p>
<p>We created a rudimentary application as a proof of concept, running a few client pilots to see how well tree testing worked online. After working with the results in Excel, <b>it became very clear which parts of the trees were failing users, and how they were failing.</b> The technique worked.</p>
<p>However, it also became obvious that a wall of spreadsheet data did not qualify as “clear results”. So when we sat down to design the next version of the tool – the version that information architects could use to run their own tree tests – reworking the results was our number-one priority.</p>
<p></p>
<h3>Participating in an online tree test</h3>
<p>So, what does online tree testing look like? Let’s look at what a participant sees.</p>
<p>Suppose we’ve emailed an invitation to a list of possible participants. (We recommend at least 30 to get reasonable results – more is good, especially if you have different types of users.) Clicking a link in that email takes them to the Treejack site, where they’re welcomed and instructed in what to do.</p>
<p>Once they start the test, they’ll see a task to perform. The tree is presented as a simple list of top-level topics:<br />
<img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/tree-testing/treejack1.top-levelcategories.jpg" width="245" height="249" alt="In Treejack, the tree is presented as a simple list of top-level topics." title="In Treejack, the tree is presented as a simple list of top-level topics."/></p>
<p>They click down the tree one topic at a time. Each click shows them the next level of the tree:<br />
<img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/tree-testing/treejack2.secondleveltopics.jpg" width="249" height="261" alt="In Treejack, each click shows them the next level of the tree." title="In Treejack, each click shows them the next level of the tree."/></p>
<p>Once they click to the end of a branch, they have 3 choices:<br />
* Choose the current topic as their answer (“I’d find it here”).<br />
* Go back up the tree and try a different path (by clicking a higher-level topic).<br />
* Give up on this task and move to the next one (“Skip this task”).</p>
<p><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/tree-testing/treejack3.selectinganswer.jpg" width="259" height="263" alt="In Treejack, the participant selects an answer." title="In Treejack, the participant selects an answer."/></p>
<p>Once they’ve finished all the tasks, they’re done – that’s it. For a typical test of 10 tasks on a medium-sized tree, most participants take 5-10 minutes. As a bonus, we’ve found that participants usually find tree tests less taxing than card sorts, so we get lower drop-out rates.<br />
<br />
<h3>Creating a tree test</h3>
<p>The heart of a tree test is…um…the tree, modeled as a list of text topics.</p>
<p>One lesson that we learned early was to build the tree based on the content of the site, not simply its page structure. Any implicit in-page content should be turned into explicit topics in the tree, so that participants can “see” and select those topics.</p>
<p>Also, because we want to measure the effectiveness of the site’s topic structure, we typically omit “helper” topics such as Search, Site Map, Help, and Contact Us. If we leave them in, it makes it too easy for users to choose them as alternatives to browsing the tree.<br />
<br />
<h3>Devising tasks</h3>
<p>We test the tree by getting participants to look for specific things – to perform “find it” tasks. Just as in a usability test, a good task is clear, specific, and representative of the tasks that actual users will do on the real site.</p>
<p>How many tasks? You might think that more is better, but we’ve found a sizable learning effect in tree tests. After a participant has browsed through the tree several times looking for various items, they start to remember where things are, and that can skew later tasks. For that reason, we recommend about 10 tasks per test, presented in a random sequence.</p>
<p>Finally, for each task, we select the correct answers – 1 or more tree topics that satisfy that task.<br />
</p>
<h3>The results</h3>
<p>So we’ve run a tree test. How did the tree fare?</p>
<p>At a high level, we look at:<br />
* <b>Success</b> &#8211; % of participants who found the correct answer. This is the single most important metric, and is weighted highest in the overall score.<br />
* <b>Speed</b> – how fast participants clicked through the tree. In general, confident choices are made quickly (i.e. a high Speed score), while hesitation suggests that the topics are either not clear enough or not distinguishable enough.<br />
* <b>Directness</b> – how directly participants made it to the answer. Ideally, they reach their destination without wandering or backtracking.</p>
<p>For each task, we see a percentage score on each of these measures, along with an aggregate score (out of 10):<br />
<img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/tree-testing/treejack4.resultsview.jpg" width="416" height="163" alt="Showing Treejack results with a percentage score of each measure and an aggregate score." title="Showing Treejack results with a percentage score of each measure and an aggregate score."/></p>
<p>If we see an overall score of 8/10 for the entire test, we’ve earned ourselves a beer. Often, though, we’ll find ourselves looking at a 5 or 6, and realise that there’s more work to be done.</p>
<p>The good news is that our miserable overall score of 5/10 is often some 8’s and 9’s brought down by a few 2’s and 3’s. This is where tree testing really shines &#8212; <b>separating the good parts of the tree from the bad</b>, so we can spend our time and effort fixing the latter.</p>
<p>To do more detailed analysis on the low scores, we can download the data as a spreadsheet, showing destinations for each task, first clicks, full click paths, and so on.</p>
<p>In general, we’ve found that tree-testing results are <b>much easier to analyse</b> than card-sorting results. The high-level results pinpoint where the problems are, and the detailed results usually make the reason plain. In cases where a result has us scratching our heads, we do a few in-person tree tests, prompting the participant to think aloud and asking them about the reasons behind their choices.<br />
</p>
<h3>Lessons learned</h3>
<p>We’ve run several tree tests now for large clients, and we’re very pleased with the technique. Along the way, we’ve learned a few things too:<br />
* <b>Test a few different alternatives.</b> Because tree tests are quick to do, we can take several proposed structures and test them against each other. This is a quick way of resolving opinion-based debates over which is better. For the government web project we discussed earlier, one proposed structure had much lower success rates than the others, so we were able to discard it without regrets or doubts.
<p />
* <b>Test new against old.</b> Remember how we promised that government agency that we would deliver a better IA, not just a different one? Tree testing proved to be a great way to demonstrate this. In our baseline test, the original structure notched a 31% success rate. Using the same tasks, the new structure scored 67% &#8211; a solid quantitative improvement.
<p />
* <b>Do iterations.</b> Everyone talks about developing designs iteratively, but schedules and budgets often quash that ideal. Tree testing, on the other hand, has proved quick enough that we’ve been able to do two or three revision cycles for a given tree, using each set of results to progressively tweak and improve it.
<p />
* <b>Identify critical areas to test, and tailor your tasks to exercise them.</b> Normally we try to cover all parts of the tree with our tasks. If, however, there are certain sections that are especially critical, it’s a good idea to run more tasks that involve those sections. That can reveal subtleties that you may have missed with a “vanilla” test. For example, in another study we did, the client was considering renaming an important top-level section, but was worried that the new term (while more accurate) was less clear. Tree testing showed both terms to be equally effective, so the client was free to choose based on other criteria.
<p />
* <b>Crack the toughest nuts with “live” testing.</b> Online tree tests suffer from the same basic limitation as most other online studies – they give us loads of useful data, but not always the “why” behind it. Moderated testing (either in person or by remote session) can fill in this gap when it occurs.</p>
<p>
<h3>Conclusion</h3>
<p>Tree testing has given us the IA method we were after – a quick, clear, quantitative way to test site structures. Like user testing, it shows us (and our clients) where we need to focus our efforts, and injects some user-based data into our IA design process. The simplicity of the technique lets us do variations and iterations until we get a really good result.</p>
<p>Tree testing also makes our clients happy. They quickly “get” the concept, the high-level results are easy for them to understand, and they love having data to show their management and to measure their progress against.
<p id="fn1"><sup></sup></p>
<p id="fn2"><sup></sup></p>
<p><i>You can sign up for a free Treejack account at &#8220;Optimal Workshop&#8221;:http://www.optimalworkshop.com/treejack.htm.<sup><a href="#fn1">2</a></sup></i><br />
</p>
<h2>References</h2>
<p>1. &#8220;Don’t Make Me Think&#8221;:http://www.amazon.com/Dont-Make-Me-Think-Usability/dp/0321344758, Steve Krug<br />
2. Full disclosure: As noted in his &#8220;bio&#8221;:http://boxesandarrows.com/person/35384-daveobrien, O&#8217;Brien works with Optimal Workshop.</p>
]]></content:encoded>
			<wfw:commentRss>http://boxesandarrows.com/tree-testing/feed/</wfw:commentRss>
		<slash:comments>19</slash:comments>
		</item>
		<item>
		<title>People Finder: Searching Without Logic?</title>
		<link>http://boxesandarrows.com/people-finder-searching-without-logic/</link>
		<comments>http://boxesandarrows.com/people-finder-searching-without-logic/#comments</comments>
		<pubDate>Thu, 14 Aug 2008 07:53:04 +0000</pubDate>
		<dc:creator>Vivek Deshmukh</dc:creator>
				<category><![CDATA[Findability]]></category>
		<category><![CDATA[Interfaces]]></category>
		<category><![CDATA[Methods]]></category>
		<category><![CDATA[Special topic: Intranets]]></category>
		<category><![CDATA[Special topic: Search and Metadata]]></category>

		<guid isPermaLink="false">http://boxesandarrows.com/people-finder-searching-without-logic/</guid>
		<description><![CDATA[In large organizations, finding<br /> people is a very common intranet task. Vivek Deshmukh gives us advice on how to improve people search and really help staff find<br /> one another.]]></description>
				<content:encoded><![CDATA[<p>One of the most frequent tasks on many intranets is finding people within the company. Providing an effective way to search people is thus a key goal in designing intranets. This goal becomes even more important for an organization like Emirates, a leading international airline, which has over 35,000 employees with over 140 nationalities and where more people are likely to use this feature more frequently.</p>
<p>Our intranet provides many applications that have a people finder feature to help staff find each other. The goal in using this feature varies depending on the application and situation. For example, people may want to find a staff to book a meeting or add them to a project team. Whatever the goal, a simple text input field and a Find button are enough to provide the sought-after results. But again and again I have heard complaints about not being able to effectively find colleagues using this feature.</p>
<p>The effectiveness of the People Finder feature is challenged in the following ways:
<ol>
<li>People misspell names of staff they are searching. (e.g., &lsquo;Vivek&rsquo; is spelled as &lsquo;Vevek&rsquo;; with over 140 different nationalities this is bound to happen.)</li>
<li>Names stored in the database are not in proper format. (e.g., &lsquo;Vivek Deshmukh&rsquo; is stored as &lsquo;Vivek D.&rsquo;)</li>
<li>People are known by completely different names than the one stored in the database. (e.g., In some cultures women change their names after marriage.)</li>
</ol>
<p><img title="" height="161" alt="" width="415" src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/people-finder/People_Finder_Figure_1.gif" /><br />
<i>Figure 1: A typical example of not finding a person in staff directory</i><br />

<p>How can you design a better People Finder application than the one that so often says &ldquo;Staff not found!&rdquo;?</p>
<p/>
<h2>Building on users efforts</h2>
<p>One idea is to look at what users do with the problem at hand and how they solve it, and then use their efforts to build the application. For our People Finder application we can do this by having a &ldquo;Did you mean &hellip;&rdquo; feature, which gives alternative name suggestions to users. These suggestions are not built by pre-defined logic but are based on the collective input of users.</p>
<p>In our department, when colleagues don&rsquo;t find someone on a People Finder application, they try various strategies. These include:</p>
<ol>
<li>trying different spellings,</li>
<li>asking another colleague for the persons correct name and spelling,</li>
<li>calling the person directly (if they have their phone number), and</li>
<li>checking previous emails to get the exact spelling.</li>
</ol>
<p>Whatever activity they choose, they make sure that they have the right information to type in the People Finder text box. We need to make use of this effort (i.e., making an error and then fixing it) from the users to build our application. The following conceptual model is my attempt at designing such a system.
</p>
<p></p>
<h2>Building the application</h2>
<p>There are five essential components to this concept:</p>
<ol>
<li>Build a relation table to store incorrect entries. In other words, store search queries which produced no results.</li>
<li>Determine if the user has found the right person.</li>
<li>Build a relation between the previous incorrect entries with the last correct entry determined in step 2.</li>
<li>Check the strength of relation by observing patterns across all users.</li>
<li>Present strong patterns as a &ldquo;Did you mean &#8230;&rdquo; feature on the search results page.</li>
</ol>
<p>Let&rsquo;s look at each step in detail.<br />

<p><b>STEP 1: Build a relation table to store incorrect entries. </b></p>
<p>To explain the concept, let&rsquo;s take the scenario in which a user Sally wants to organize a meeting with Timothy Campbell using People Finder but cannot find him because Timothy Campbell is stored as Tim C. in the application database. (See Figure 1 above.) Let us store this incorrect entry Timothy Campbell in a database table called &ldquo;Relation Database Table for Sally&rdquo; (See Figure 2).</p>
<p><img title="" height="96" alt="" width="396" src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/people-finder/People_Finder_Figure_2.gif" /><br />
<i>Figure 2: Incorrect entry inserted in the Relation table for Sally</i></p>
<p>
<p><b>STEP 2: Determine if the user has found the right person. </b></p>
<p>Next, let us say Sally tries a few more names in the People Finder text box, which generate no results. We store all of these incorrect entries in the Relation table. After a few failed attempts, Sally asks her colleague how to find Timothy Campbell in the address book. She is told his name appears in the address book as Tim C. Sally types the name &lsquo;Tim C.&rsquo; and gets a result with Tim C.&rsquo;s details. Sally adds Tim C. to the meeting list. It is this action of Sally clicking the Add button that allows us to identify a correct entry for the Relation table. (See Figure 3.)</p>
<p><img title="" height="153" alt="" width="402" src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/people-finder/People_Finder_Figure_5.gif" /><br />
<i>Figure 3: Sally now types Tim Campbell&rsquo;s name as it appears in the database.</i></p>
<p>
<p><b>STEP 3: Build a relation between the previous incorrect entries with the correct entry. </b></p>
<p>We then build a relation between the previous incorrect entries with the first following correct entry (i.e., Tim C.) and add it to another table called &lsquo;Alias&rsquo; for the staff Tim C. Think of the Alias table as a &lsquo;People also know Tim C. as &hellip;&rsquo; list. Note that the basis for saying that there exists a relation between the incorrect entries and the correct entry is the real life observation that people do what they must to find the correct name to type in the search text box. Of course you may get mismatches but this will be taken care of in the next steps.</p>
<p><img title="" height="217" alt="" width="427" src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/people-finder/People_Finder_Figure_4.gif" /><br />
<i>Figure 4: Tim C. is related with the previously typed names</i></p>
<p><img title="" height="153" alt="" width="402" src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/people-finder/People_Finder_Figure_5.gif" /><br />
<i>Figure 5: Alias table for Tim C.</i></p>
<p>
<p><b>STEP 4: Check the strength of relation by observing patterns across all users.</b></p>
<p>Next we identify the most common aliases used for finding Tim C. We do this by looking at the Alias table for Tim C. Those aliases that appear frequently are strong candidates to be displayed with a &ldquo;Did you mean &hellip;&rdquo; feature. In our example Timothy Campbell and Tim Campbell show a good pattern across different users as aliases for Tim C., so we conclude that when people search for Tim Campbell they mean Tim C.</p>
<p><img title="" height="393" alt="" width="657" src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/people-finder/People_Finder_Figure_6.gif" /><br />
<i>Figure 6: Alias table shows that lot of people type Timothy Campbell or Tim Campbell to find Tim C.</i></p>
<p>
<p><b>STEP 5: Present common patterns as &ldquo;Did you mean &#8230;&rdquo; feature on the search results page. </b></p>
<p>The last step is to present the most common pattern to the users as a &ldquo;Did you mean &hellip;&rdquo; feature. In our example when -users search Timothy Campbell we present them with Tim C. as a &ldquo;Did you mean &hellip;&rdquo; feature. We can show additional information like the department, title or a photo of Tim Campbell so that the user can confirm that it&rsquo;s the person he is looking for.</p>
<p><img title="" height="296" alt="" width="672" src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/people-finder/People_Finder_Figure_7.gif" /><br />
<i>Figure 7: Implementation of the &ldquo;Did you mean &hellip;&rdquo; feature</i></p>
<p></p>
<h2>Making the system efficient</h2>
<p>The secret to making our system more efficient is eliminating -irrelevant relations. Consider this question: Should you build a relation between an incorrect entry which was entered at 08.30 and the next correct entry entered at 09.20? Probably not! It is very unlikely that the user will search for the same person after a gap of 50 minutes. A time frame of 20 minutes may be more realistic.</p>
<p>
<h2>Advice on how to go about building such a system</h2>
<ol>
<li><b>Build a business case </b><br />Building such a system will take time and resources. You will need to present an argument to management why this is important and perhaps make a business case for the effort. Don&rsquo;t forget to include key stakeholders like Human Resources while presenting the business case. Here are some key points:
<ul>
<li>Users will save valuable time while searching other staff.</li>
<li>Colleagues will not be disturbed &ndash; their time will be saved.</li>
<li>Companies will save on phone bills and employee time.</li>
<li>Systems become robust over time without additional work from users or a massive data cleaning effort.</li>
</ul>
</li>
<li><b>Collaborate and co-ordinate with different IT teams who build applications </b><br />Pepare a list of all applications that use the People Finder feature. Collaborate with the IT teams who are responsible to build these applications and work out a plan to implement the &ldquo;Did you mean &hellip;&rdquo; feature on the current applications. This task becomes easier if you have a centralized IT team.</li>
<p>
<li><b>Prioritize applications for implementation </b><br />Our system becomes robust when many users use the system. Start implementing the feature on the &ldquo;Frequently used by many&rdquo; type of applications first. These applications will give maximum value in shortest amount of time.</li>
</ol>
<h2>The risks</h2>
<p>There is a risk that few users can work together to build a strong pattern of, say, &ldquo;Jerk&rdquo; with Rob Stevenson, thus manipulating the system. This can be kept in check by doing two things. If your company is like ours (formal and very particular about its image) you can:</p>
<ol>
<li>Keep pattern strength high especially if the suggestions to users are going to be automatic without human intervention.</li>
<li>In addition to high pattern strength you can include a manual check done by HR admin who can authorize or investigate each strong pattern. To do this you will need to provide an admin interface to HR where they can monitor and dig deeper in to strong aliases.</li>
</ol>
<h2>Going forward</h2>
<p>Just as users learn a new system by using it, possibly by making mistakes on the way, a system can also be &rdquo;trained&quot; to learn from the users by continuously &ldquo;listening&rdquo; to users inputs, while helping users along the way. Though some development effort and technical know-how is required, more intelligent people-finding features on company intranets are essential. There is a long-term payoff, and companies will be saving a great deal in terms of employee time and costs.</p>
]]></content:encoded>
			<wfw:commentRss>http://boxesandarrows.com/people-finder-searching-without-logic/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Advancing Advanced Search</title>
		<link>http://boxesandarrows.com/advancing-advanced-search/</link>
		<comments>http://boxesandarrows.com/advancing-advanced-search/#comments</comments>
		<pubDate>Wed, 16 Jan 2008 16:29:32 +0000</pubDate>
		<dc:creator>Stephen Turbek</dc:creator>
				<category><![CDATA[Findability]]></category>
		<category><![CDATA[Interactivity]]></category>
		<category><![CDATA[Interfaces]]></category>
		<category><![CDATA[Special topic: Search and Metadata]]></category>

		<guid isPermaLink="false">http://boxesandarrows.com/advancing-advanced-search/</guid>
		<description><![CDATA[The success of the simple search box has relegated advanced search to second-class status. Stephen Turbek looks to resurrect this useful feature from the dustbins of the design toolbox and suggest some useful ways for designers to utilize it effectively.]]></description>
				<content:encoded><![CDATA[<p>Advanced search is the ugly child of interface design -always included, but never loved.  Websites have come to depend on their search engines as the volume of content has increased. Yet advanced search functionality has not significantly developed in years. Poor matches and overwhelming search results remain a problem for users. Perhaps the standard search pattern deserves a new look. A progressive disclosure approach can enable users to use precision advanced search techniques to refine their searches and pinpoint the desired results.<br />
<img width="632" height="39" src="/files/banda/advancing-advanced/yahoo-search-box-FINAL.GIF" alt="yahoo-search-box-FINAL.GIF" /><br />
<i> Fig. 1: A typical separation of standard and advanced search (Yahoo!). The design discourages use of advanced search.</i><br />
In the quest to make web sites more usable, we settled on a pattern of a clean, minimal search box with a link to an advanced search page.  Jakob Nielsen recommended, &ldquo;use an intimidating name like &lsquo;advanced search&rsquo; to scare off novice users from getting into the page and hurting themselves.&rdquo;(1)  This model has been successful. Search rivals hierarchical website navigation on many sites and is the primary means of navigation on enormous sites such as Ebay and Amazon. Advanced search, however, has not fared so well, with only a small percentage of users using it.</p>
<p>
<b><span style="font-size: larger;"> Why most people don&rsquo;t use advanced search</span></b></p>
<p>Despite its name, advanced search has not advanced very far. There is great power to conquer the overwhelming number of search results, but the current standard presents barriers to users. Specifically,</p>
<ul>
<li>The link is often small, vague, and does not describe benefits to the user</li>
<li>Advanced search pages typically have confusing page design for the few who make it there.</li>
<li>There is generally poor search revision functionality: Once a search is performed, the &ldquo;advancedness&rdquo; is lost. For example, the Google advanced search delivers the standard search results page. You have to get the query right the first time; there is no opportunity to adjust your query.</li>
</ul>
<p><span style="font-size: smaller;"><br />
</span> <img width="320" height="353" src="/files/banda/advancing-advanced/google-advanced-search-FINAL.JPG" alt="google-advanced-search-FINAL.JPG" /></p>
<p><i> Fig. 2: Google&rsquo;s advanced search is still complicated.</i></p>
<p>
<span style="font-size: larger;"><b>Who does use advanced search?</b></span></p>
<p>Tim Bray wrote, &ldquo;The people who do use Advanced Search are your most fanatical users, the professional librarians, spooks, and private investigators.&rdquo;(2) As in many situations, segmenting a population makes it harder to see whether the needs of a subgroup are shared by the majority. Current design is based on the idea that there are two separate audiences with distinct needs. The reality is that there are overlapping needs that are limited by the search pattern. <br />
<b>The essential problem of search</b> &mdash; too many irrelevant results &mdash; <b>has not gone away.</b><br />
A typical user has to make a choice between doing a search and clicking a link to do a search. In other words, do you want it now, or want to go somewhere else to look? The immediacy of the search text field and the complexity of advanced search means that users will try the text search first. <br />
Perhaps the old framing is wrong. Rather than being a matter of geeks versus normal people, the question should be whether users see a benefit to advanced search on starting. Unfortunately, there is typically no way to use advanced search at the point users realize they need it &mdash; when they haven&rsquo;t found what they were looking for and have too many search results. They have &ldquo;missed the exit&rdquo; to advanced search. Users don&rsquo;t want to lose the investment in their search; they need a way to use additional techniques to work with what they have. A new model of search can help with this problem.</p>
<p><span style="font-size: larger;"><b>Other approaches to search</b></span><br />
Let&rsquo;s take a quick look at other search innovations from the last decade.  Web data has matured and become more structured. Taxonomies and tagging are now common. There are new opportunities to deal with search results overload by filtering, as long as it is clear and easy for the audience. Despite the truism that users will not go past the first page of search results, they will use obvious tools to refine their searches.</p>
<p><b>Searching by defined parameters</b> is natural in some circumstances (for example, airline ticket searching) but the majority of sites are not sufficiently data-driven to have an interface designed around the data.</p>
<p><img width="280" height="369" src="http://www.boxesandarrows.com/files/banda/advancing-advanced/travelocity.gif" alt="image of travelocity.com flight search form" /><br />
<i>Fig. 3: travelocity.com  Flight searches, in a sense, only use advanced search.</i></p>
<p>&nbsp;</p>
<p><b>Tags</b> can improve search results by better describing what someone is seeking.</p>
<p><span style="color: rgb(255, 0, 0); font-size: 18px; font-weight: bold;" class="Apple-style-span"> </span></p>
<p style="margin: 0px; font-family: Helvetica; font-style: normal; font-variant: normal; font-weight: normal; font-size: 12px; line-height: normal; font-size-adjust: none; font-stretch: normal;"><img width="722" height="128" src="http://www.boxesandarrows.com/files/banda/advancing-advanced/amazon_tagging-FINAL.jpg" alt="amazon.com tag interface" /></p>
<p><i>Fig. 4: amazon.com&rsquo;s approach to tagging a product</i></p>
<p><b>Faceted searching</b>, or returning browsing categories in the search results (as eBay does) can be effective at prompting the user to select a single category.</p>
<p><img width="683" height="335" src="/files/banda/advancing-advanced/ebay_faceted-FINAL.GIF" alt="ebay's faceted search" /><br />
<i>Fig. 5: ebay.com&rsquo;s faceted search</i></p>
<p><span style="font-size: larger;"><b>Filtering search results</b></span></p>
<p>Amazon and Kayak offer filters to enable users to reduce many results to a few. This can be very effective, but there are obvious constraints due to the limited space available for each filter. Only the first few filters are visible when the page loads.<br />
<img width="309" height="372" src="/files/banda/advancing-advanced/amazon.com_filter-FINAL.JPG" alt="amazon filter" />  <img width="308" height="372" src="http://www.boxesandarrows.com/files/banda/advancing-advanced/kayak_filter-FINAL.JPG" alt="kayak.com search filtering" /></p>
<p><i>Fig. 6: Filtering on amazon.com and Kayak</i></p>
<p><span style="font-size: larger;"><b>Another approach: Progressive disclosure of functionality</b></span><br />
One solution to the essential problems of advanced search discoverability and complexity is to progressively disclose (3) the functionality to the user. Instead of a single, complicated page, break it into understandable units and give each to the user when they ask for it.<br />
In this example, show the ways the user could filter the results (e.g., &ldquo;Brand&rdquo; or &ldquo;Price&rdquo;) in a highlighted banner above the search results.</p>
<p><span style="color: rgb(255, 0, 0);"><span style="font-size: large;"><span style="color: rgb(51, 51, 51); font-family: Helvetica; font-size: 12px;" class="Apple-style-span"><img src="http://www.boxesandarrows.com/files/banda/advancing-advanced/example1-filter-bar-FINAL.gif" alt="show the ways a user can filter" /></span><br />
</span></span></p>
<p><i>Fig. 7: Filters highlighted in a banner (it doesn&rsquo;t have to be green, it just has to stand out!)</i></p>
<p>
When the user clicks a link, display the filter with enough space to clearly articulate how to use it. Don&rsquo;t cram it in; the user asked for it. In contrast to the left-hand column filters in the examples above, which are naturally space-constrained, this method can hold many more types of filters and doesn&rsquo;t show functionality they didn&rsquo;t request.</p>
<p style="margin: 0px; font-family: Helvetica; font-style: normal; font-variant: normal; font-weight: normal; font-size: 12px; line-height: normal; font-size-adjust: none; font-stretch: normal;"><img width="552" height="36" src="http://www.boxesandarrows.com/files/banda/advancing-advanced/filter-closeup-FINAL.gif" alt="a filter" /></p>
<p><span style="font-style: italic;" class="Apple-style-span">Fig. 8: When the user clicks &quot;price&quot; above, give the module enough space to be readable.</span></p>
<p>Let&rsquo;s look at an example scenario.</p>
<p>1.    The user performs a text search normally.</p>
<p style="margin: 0px; font-family: Helvetica; font-style: normal; font-variant: normal; font-weight: normal; font-size: 12px; line-height: normal; font-size-adjust: none; font-stretch: normal;"><img width="547" height="46" src="http://www.boxesandarrows.com/files/banda/advancing-advanced/example1-search-bar-FINAL1.gif" alt="starting a search" /></p>
<p><i>Fig. 9: Searching for a <span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps">DVD</span></span></span></span></span></span></span></span></span></span></span></span></span> player using the simple search box</i></p>
<p>2.    On the search results page, show options to filter the search in a prominent location above the results. Communicate the value of filtering. Order them by popularity, using size and font weight to highlight others. If there are many options, consider hiding rarely used options under an expandable section.</p>
<p><img width="563" height="398" src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/advancing-advanced/example1.gif" alt="example1.gif" /><br />
<i>Fig. 10: A search results page, with options to filter.</i></p>
<p>
3.    When the user clicks an option, display a page module for that search parameter without reloading the page. The user should be able to change the parameters at any time to receive an updated search. If possible, show the number of results for each parameter, so the user can see how exclusive it is, and identify which parameter maybe giving them 0 results.<br />
<img width="563" height="459" src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/advancing-advanced/example2.gif" alt="example2.gif" /><br />
<i> Fig. 11: The search results, filtered by price.</i></p>
<p>
4.    Enable the user to add several modules, stacked in chronological order as the user builds up a complicated query.</p>
<p><img width="563" height="392" src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/advancing-advanced/example3.gif" alt="example3.gif" /></p>
<p><i>Fig. 12: </i><span style="font-style: italic;" class="Apple-style-span">The search results, filtered by price and rating.</span></p>
<p>
<span style="font-size: larger;"><b>Recommendations</b></span><br />
Define an implicit method for Boolean rules  (AND and OR rules) based on normal search patterns &mdash; do not ask users to compose Boolean queries. A system that has worked for me is this: If a user selects several different search parameters, perform an <span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps">AND</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span> search between them (e.g., Sony <span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps">AND</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span> Portable). If they choose multiple values for the same parameter, perform an OR search (e.g., Sony OR Panasonic). However, if parameters (such as product features) are clearly non-exclusive, perform an <span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps">AND</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span> search (e.g., Portable <span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps">AND</span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span></span> &ldquo;HD ready&rdquo;).</p>
<p>Recognize that quick searches, text searches, and advanced searches may be built with different technologies (e.g., direct database searches, a Google box, or a content management system). You may need to work closely with the developers to make a seamless transition between technologies.</p>
<p>If there are many parameters (more than 15), consider reducing complexity by hiding less used ones under a &ldquo;see more&hellip;&rdquo; link below the displayed options. Clicking it should display all the options without a page refresh. Evaluate your search logs to make sure you are exposing the right ones. Consider rotating the exposed ones to discover potential popular features, as exposed options will naturally get more usage.</p>
<p><span style="font-size: larger;"><b>Conclusions</b></span><br />
At its core, advanced search is an under-utilized tool hampered by its own design. By enabling the user to add specificity as they request it, designs such as the one above avoid the lonely fate of the standard advanced search page.  Defusing this complexity and locating it where users will naturally find it will help advanced search be truly advanced.</p>
<p><b>References<br />
</b>(1) Nielsen, J. (1997).  <a href="http://www.useit.com/alertbox/9707b.html">Search and You <i>May</i> Find</a>.  Alertbox, July 15, 1997.<br />
(2) Bray, T. (2003).  <a href="http://www.tbray.org/ongoing/When/200x/2003/06/17/SearchUsers">On Search: The Users</a>.  <br />
(3) Wikipedia. <a href="http://en.wikipedia.org/wiki/Progressive_disclosure">Progressive Disclosure</a> </p>
]]></content:encoded>
			<wfw:commentRss>http://boxesandarrows.com/advancing-advanced-search/feed/</wfw:commentRss>
		<slash:comments>24</slash:comments>
		</item>
		<item>
		<title>Strategies for Improving Enterprise Search</title>
		<link>http://boxesandarrows.com/strategies-for-improving-enterprise-search/</link>
		<comments>http://boxesandarrows.com/strategies-for-improving-enterprise-search/#comments</comments>
		<pubDate>Tue, 11 Sep 2007 19:27:08 +0000</pubDate>
		<dc:creator>John Ferrara</dc:creator>
				<category><![CDATA[Findability]]></category>
		<category><![CDATA[Special topic: Search and Metadata]]></category>

		<guid isPermaLink="false">http://boxesandarrows.com/strategies-for-improving-enterprise-search/</guid>
		<description><![CDATA[Installing a search engine is just the beginning of creating an effective enterprise search system. John Ferrara addresses critical aspects of the user experience often overlooked or ignored.  ]]></description>
				<content:encoded><![CDATA[<p>It&#8217;s common for enterprise website developers to implement search engines with out-of-the-box functionality, point it at their content repositories, and then just leave it at that. Search is becoming something of a neglected orphan, in part because packaged search products are relatively easy to implement, and then even more easily forgotten.</p>
<p>Unfortunately, the results are too often plagued by problems. You know something&#8217;s gone wrong when a perfectly clear query returns results that are not only irrelevant, but seemingly deranged. Pages with a logical relationship to the initial request compete for placement among what Jared Spool fittingly calls &#8220;wacko results.&#8221;<sup><a href="#fn1">1</a></sup>  The majority of participants walking into my usability tests report they don&#8217;t trust embedded site search to help them find what they&#8217;re looking for. </p>
<p>Quality search results only come about through applied effort, requiring in particular the skills of an information architect.<sup><a href="#fn2">2</a></sup>  And IAs must be ready to go well beyond their traditional front-end role, digging into the functional backend and source data of the search engine. This article outlines how we can bolster findability and win back users&#8217; confidence.</p>
<h1>Conceptualizing the Task</h1>
<p>The results of any given search are impossible to predict with precision (short of having tried it before). That&#8217;s because five distinct variables combine to determine its outcome (Figure 1):</p>
<ol>
<li><strong>Search engine.</strong> The algorithmic gears that parse the query and assign pages relevance.</li>
<li><strong>Content.</strong> The documents searched.</li>
<li><strong>Index.</strong>  A catalog of the locations of every word in every document. This is what allows Google to miraculously find 5 billion instances of the word &#8220;the&#8221; in 0.2 seconds.</li>
<li><strong>User input.</strong>  The keywords and other parameters the user submits.</li>
<li><strong>Results display.</strong>  The way the data returned by the search engine is presented.</li>
</ol>
<p><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/strategies-for/ferrara_strategies_fig1.gif" width="500" height="334" alt="ferrara_strategies_fig1.gif" /></p>
<p><strong>Figure 1. Five variables that determine the success of a site search. </strong></p>
<p>Critically, the search engine isn&#8217;t the only factor that determines the outcome, so search can&#8217;t be seen purely as a technology problem. It&#8217;s important for organizations to realize that their investment in search doesn&#8217;t end with the product&#8217;s implementation; the most successful approaches will go further to include strategies addressing all of the outside variables.</p>
<h1>Strategies</h1>
<p>Several engine products allow you to tweak the search engine&#8217;s algorithm itself, but I don&#8217;t recommend it. That would be like doing brain surgery to fix a speech impediment&mdash;whether or not you solve that problem, you&#8217;ll inevitably cause a great many more. Changing the algorithm affects all searches, including the ones that already work just fine. So it&#8217;s easiest to keep it stable and modify the factors surrounding it.</p>
<p>Taking the search engine as a constant, then, there are four variables that affect the quality of search. Strategies for improving each of these are proposed below.</p>
<h2>Strategy 1: Make the Content Machine-Readable</h2>
<p>Search engines can provide better results when they&#8217;re given better content. The trick is to provide a basis for inferring the content&#8217;s meaning.</p>
<h3>Structural Markup</h3>
<p>The XHTML structure of pages is relevant to the IA, because content that is more machine-readable will be easier to find using search. Pages should extensively use the correct semantic elements: &lt;h1&gt; through &lt;h6&gt;, &lt;p&gt;aragraph, &lt;q&gt;uotation, &lt;caption&gt;, and so on, as well as semantically named &#8220;class&#8221; attributes.  This will help the search engine compare the usage of terms among pages, to distinguish the central topic of a page from peripheral concepts (Figure 2). While IAs typically don’t mark up individual pages, they can influence the process by specifying template-level semantic elements in their wireframes and participating in periodic content reviews. </p>
<p><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/strategies-for/ferrara_strategies_fig2.gif" width="550" height="413" alt="ferrara_strategies_fig2.gif" /></p>
<p><strong>Figure 2. Structural markup explains that Jupiter is the central topic of page A, while in page B it&#8217;s just one of several subpoints on observing planets.</strong></p>
<h3>Standard Meta Tags</h3>
<p>Most websites use keywords and descriptions in meta tags, but not often as part of a larger strategy. The first step is to create a controlled vocabulary, a standardized set of keywords.<sup><a href="#fn3">3</a></sup>  If you tag them as &#8220;teachers&#8221; over here, but &#8220;professors&#8221; over there, the search engine will have a hard time understanding why they&#8217;re the same thing. The keywords should also reflect actual terminology from the page itself (especially headings) and be reinforced in the description tag.</p>
<h3>More Metadata</h3>
<p>Go beyond keywords. Tags that describe the target audience groups, the sector of a financial service, or the cuisine of a recipe page provide more ways to compare and contrast the content; search engines will read as much meta information as you give them. There is a practical limit to how much you can do, which makes user-defined tags well worth considering.</p>
<h3>Ontology</h3>
<p>Humans know that pugs are dogs, and dogs chase cats, and cats play with yarn, but these relationships are lost on computers. An ontology is a list of concepts linked by the ways they relate to one another (Figure 3), helping the search engine grasp the content&#8217;s meaning. If your search product supports ontologies (several do), this can significantly improve the quality of the results.<sup><a href="#fn4">4</a></sup></p>
<p><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/strategies-for/ferrara_strategies_fig3.gif" width="500" height="210" alt="ferrara_strategies_fig3.gif" /></p>
<p><strong>Figure 3. An ontology explains the relationships between concepts.</strong></p>
<h2>Strategy 2: Index All of the Right Data</h2>
<p>Indexes have made searching remarkably expedient, but the way they&#8217;re built has a lot to do with the quality and reliability of results. Proper indexing requires taking a hands-on approach, and the IA has an interest in working with the development team to influence it.</p>
<h3>Ignoring Unnecessary Content</h3>
<p>Search engines will automatically index the entire content of a page, regarding everything as equally important. This is a problem because the navigation, for example, will contain terms that are specifically relevant to the siblings, parents, and children of a page, and not to the page itself (Figure 4). There are several methods of excluding this content; the important thing is to make sure that it&#8217;s done, because this is one of the most common reasons why searches return bizarre results.</p>
<p><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/strategies-for/ferrara_strategies_fig4.gif" width="500" height="366" alt="ferrara_strategies_fig4.gif" /></p>
<p><strong>Figure 4.  A search for &#8220;Neptune&#8221; may return results that include this page about Jupiter because the term &#8220;Neptune&#8221; appears here in the navigation.</strong></p>
<h3>Getting All Resources</h3>
<p>Users reasonably expect a search to return all of the website&#8217;s relevant publicly available documents.  Unfortunately, many search products can&#8217;t index .pdf, .doc, .xls, .ppt, and similar files, and you can forget about content locked away in audio or video files. The best fix is to convert application files to XHTML and provide transcripts or summaries of multimedia files. This can be a big job, so you may want to initially convert just the most commonly accessed documents.</p>
<h2>Strategy 3: Make the Most of User Input</h2>
<p>It can be difficult to figure out how to phrase a query. Users have to express what are often complicated concepts in that particular set of words that a given search engine will like. It&#8217;s important to make the most of what users submit on their first attempt, because they&#8217;re much less likely to make a second.<sup><a href="#fn5">5</a></sup></p>
<h3>Query Expansion</h3>
<p>All contemporary search vendors offer some type of query expansion, where the search engine automatically looks for words related to the ones the user actually entered (Figure 5). Word stemming, which searches for different forms of the same word, is usually enabled by default.  However, the thesaurus, which searches for equivalent and related terms, requires manual work.<sup><a href="#fn6">6</a></sup>
</p>
<p><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/strategies-for/ferrara_strategies_fig5.gif" width="500" height="284" alt="ferrara_strategies_fig5.gif" /></p>
<p><strong>Figure 5.  Searches shouldn&#8217;t only look for the terms as the user entered them, but for related and alternate forms of those terms.</strong></p>
<p>You can go overboard defining synonyms, but the problem is usually too little (by which I mean &#8220;none at all&#8221;) rather than too much.<sup><a href="#fn8">8</a></sup> Search logs are the best resource for discovering synonyms, related terms, and common misspellings. Set up ongoing reviews to add terms that users actually submit to the thesaurus, drawn from the wealth of data that&#8217;s freely available in the logs. The number of successful first attempts will rise dramatically over time.</p>
<h3>Syntax Conventions</h3>
<p>Users should be able to submit searches in whatever way they learned to write them. Unfortunately, search engines have different syntaxes for the standard operators (And, Or, Not, exact string). You can&#8217;t rely on a help file&mdash;it&#8217;s one of people&#8217;s least favorite things to read. The parser should instead be scripted to accept all common syntax conventions, so the user doesn&#8217;t have to guess. It should also use &#8220;And&#8221; as the default operator, which will appropriately limit the results downward as more terms are added to the search.</p>
<h3>Assisting Query Formulation</h3>
<p>Suggestion functions provide users with a list of similar queries that other people have tried as they type. This makes a lot of sense, since it can be difficult to put a complex idea into words or to recall the precise name of an item. Stellar examples of suggest functions include <a href="http://labs.google.com/suggest">Google Suggest</a>, <a href="http://livesearch.alltheweb.com/">AllTheWeb</a>, and <a href="http://www.apple.com">Apple&#8217;s website</a>.</p>
<h2>Strategy 4: Build the Results Page Around the User&#8217;s Needs</h2>
<p>The results page should be designed to help users find matches for their interests as quickly as possible. This is closer to the IA&#8217;s typical interface design role, yet it&#8217;s still uncommon to see much more than the vendor&#8217;s out-of-the-box functionality on search results pages.</p>
<h3>Showing Relevance</h3>
<p>Sometimes a search engine will return the right results, but the user will fail to recognize it. Users need to see why results are relevant to their searches. There are two simple ways to do this.</p>
<p>The first is to show a text excerpt from the page that contains the terms from the user&#8217;s query, instead of the &lt;meta&gt;description field. The description may vary greatly from the user&#8217;s entered query&mdash;especially on long pages&mdash;and it may not be at all clear why a particular page was retrieved. Instead, an excerpt of the actual content that matches the search will directly explain why a user might want to click through to that page.</p>
<p>The second way to show relevance is to bold the terms in the excerpt that match terms in the user&#8217;s original query. That will help the user to quickly scan the page for the results that have the right words in the right context (Figure 6).</p>
<p><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/strategies-for/ferrara_strategies_fig6.gif" width="500" height="375" alt="ferrara_strategies_fig6.gif" /></p>
<p><strong>Figure 6.  Excerpting and term highlighting allow the user to understand how each result relates to the query, and quickly identify the ones that are most relevant.</strong></p>
<h3>Best Bets</h3>
<p>Despite all optimization efforts, search engines sometimes still miss strong associations that are obvious to people. In cases where particular keywords should be returning specific pages, it can be helpful to include a list of manually specified &#8220;Best Bets,&#8221; triggered by business rules (Figure 7).<sup><a href="#fn8">8</a></sup>  This reintroduces the designer&#8217;s influence into search, smoothing out irregularities in the reliability of automated results.</p>
<p><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/strategies-for/ferrara_strategies_fig7.gif" width="500" height="292" alt="ferrara_strategies_fig7.gif" /></p>
<p><strong>Figure 7.  Best bets allow the designer to force particular pages to be returned when the user&#8217;s query contains a specific string.</strong></p>
<h3>Conditional Content</h3>
<p>Taking Best Bets one step further, consider including contextually appropriate content in the search results page when a string in the user&#8217;s query indicates the user probably has a particular interest in mind.  For example, a user searching for &#8220;extrasolar planets&#8221; on an astronomy website might appreciate a results page that includes a list comparing the properties of all planets discovered beyond our solar system.</p>
<h1>Conclusion</h1>
<p>This article introduces just some of the steps that you can take to improve the overall search experience on your site. The reliability of enterprise search needs significant improvement to reestablish user confidence, and IAs should take the lead. To get there, a product&#8217;s out-of-the-box functionality must not be seen as the end, but as just the beginning. </p>
<p>
<strong>REFERENCES</strong></p>
<ul>
<li>
<p id="fn1"><sup>1</sup> Jared Spool: <a href="http://www.uie.com/brainsparks/2006/04/14/bbc-reports-users-lose-patience-with-poor-search-2/">&#8220;BBC Reports Users Lose Patience with Poor Search&#8221; </a> </li>
<li>
<p id="fn2"><sup>2</sup> Lou Rosenfeld &#038; Peter Morville, <em>Information Architecture for the World Wide Web</em>, pp 136-137. </li>
<li>
<p id="fn3"><sup>3</sup> Fred Leise, Karl Fast, and Mike Steckel: <a href="http://www.boxesandarrows.com/view/creating_a_controlled_vocabulary">&#8220;Creating a Controlled Vocabulary&#8221;</a> </li>
<li>
<p id="fn4"><sup>4</sup> Tim Berners-Lee: <a href="http://www.sciam.com/article.cfm?articleID=00048144-10D2-1C70-84A9809EC588EF21&#038;catID=2">&#8220;The Semantic Web&#8221;</a> </li>
<li>
<p id="fn5"><sup>5</sup> Jared Spool: <a href="http://www.uie.com/articles/users_search_once/">&#8220;People Search Once, Maybe Twice&#8221;</a> </li>
<li>
<p id="fn6"><sup>6</sup> Christina Wodtke, <em>Information Architecture: Blueprints for the Web</em>, pp. 137-140.</li>
<li>
<p id="fn7"><sup>7</sup>Lou Rosenfeld &#038; Peter Morville, <em>Information Architecture for the World Wide Web</em>, pp. 188-189.</li>
<li>
<p id="fn8"><sup>8</sup> Chris Farnum: <a href="http://www.slideshare.net/ChrisFarnum/tuning-up-site-search-ia-summit-2007">&#8220;Tuning up Site Search&#8221;</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://boxesandarrows.com/strategies-for-improving-enterprise-search/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Doing Today&#8217;s Job with Yesterday&#8217;s Tools</title>
		<link>http://boxesandarrows.com/doing-todays-job-with-yesterdays-tools/</link>
		<comments>http://boxesandarrows.com/doing-todays-job-with-yesterdays-tools/#comments</comments>
		<pubDate>Mon, 26 Feb 2007 05:44:39 +0000</pubDate>
		<dc:creator>Patrick Dubroy</dc:creator>
				<category><![CDATA[Findability]]></category>
		<category><![CDATA[Interfaces]]></category>
		<category><![CDATA[Professionalism]]></category>

		<guid isPermaLink="false">http://boxesandarrows.com/doing-todays-job-with-yesterdays-tools/</guid>
		<description><![CDATA[Where is the software that can help us cope with the massive amounts of information that we deal with on a daily basis? Patrick Dubroy points out the problems with current personal information management , and makes suggestions about how to
improve the situation.]]></description>
				<content:encoded><![CDATA[<h2>The Problem</h2>
<p>Okay, I&rsquo;ll admit it. I&rsquo;m hopelessly disorganized in my digital life. My inbox is overflowing with email. My documents are scattered across a half dozen hard drives, none of them backed up. When I recently needed an up-to-date resume, I had to write it from scratch, because I couldn&rsquo;t find a copy anywhere.</p>
<p>Most people would say that it&rsquo;s my own fault. It&rsquo;s true; I should take greater care in organizing my data. But honestly, I&rsquo;m just too lazy to spend the time to sort all my files into the proper folders. And I&rsquo;d like to think that I have more important things to worry about than when I ran my last backup.</p>
<p>There&rsquo;s an old adage in software development that says laziness is a virtue. By laziness, we mean only avoiding <em>unpleasant</em> work. For a programmer, the most tedious work to do is work that could be done by a program. Rather than spend an hour on a repetitive task, a programmer will spend 59 minutes writing a program to complete the task in 30 seconds. As Abraham Lincoln said, &ldquo;give me six hours to chop down a tree and I will spend the first four sharpening the axe.&rdquo; In the same spirit, I justify my laziness because I think software should do most of the work of information management for me.</p>
<table width="280" border="0" align="right">
<tbody>
<tr>
<td align="center"><a href="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/doing-todays-job/itunes_metadata_th.gif"><img width="250" height="199" alt="itunes metadata " src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/doing-todays-job/itunes_metadata_th.gif" /></a><br />
            iTunes uses formal metadata fields</td>
</tr>
<tr>
<td align="center"><a href="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/doing-todays-job/flickrtags.gif"><img width="250" height="157" alt="flickr tags" src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/doing-todays-job/flickrtags_th.jpg" /></a>Flickr prefers freeform &ldquo;tagging&rdquo;</td>
</tr>
</tbody>
</table>
<p>There are plenty of great information management tools out there. Certainly, iPhoto has made it easier to organize my digital photos. Flickr and Del.icio.us have popularized tagging&mdash;organizing items by simply marking them with keywords&mdash;and created a new way to navigate large amounts of data. And iTunes is a definite improvement over manually organizing MP3s into folders.</p>
<p>But as helpful as these applications are, they can be frustrating to use, because each one implements a slightly different set of features, even though they are basically solving the same information management problems. For example, iPhoto allows you to tag a photo with keywords, but iTunes doesn&rsquo;t allow you to do the same thing for a song. Subtle incompatibilities like this can contribute to a frustrating user experience, because the interface doesn&rsquo;t behave like you expect it to.</p>
<p>Even worse than slight incompatibilities between applications, is that they often support entirely different data models. In <cite>The Design of Everyday Things</cite>, Donald Norman explains that when we use a tool&mdash;like a drill, a car, or a computer application&mdash;we have a <em>mental model</em> in mind of how the tool works, and how it will react to our actions. This mental model guides how we use the tool. With so many different applications to manage our data, we have to keep track of several different data models, and it&rsquo;s easy to get confused. For instance, when I&rsquo;m browsing my photos, I might see a photo that I want to send to a friend. In both Picasa and iPhoto, I can click a button that allows me to email the photo to them. But I can&rsquo;t do the same thing with a song in iTunes, or a bookmark in Firefox. What&rsquo;s so different about each of those things? In my mental model, they are all just objects that I want to send to my friend. Unfortunately, this data lives in a balkanized world, and what we are allowed to do with the data depends on what form it is in.</p>
<p>This balkanization of our data also makes it more difficult to find things. Before being able to search for something, you have to know what form the data is in, so that you can search in the right application. Did I store it as a Del.icio.us bookmark? Did someone email it to me, or was it in an instant message? Applications like Google Desktop and Apple&rsquo;s Spotlight help address this problem, but they support a limited number of data formats, and they aren&rsquo;t able to search across multiple machines.</p>
<p>Another usability problem occurs when trying to share data between applications. A really simple example: my friend Ryan asks me to email him the photos from our last trip to Mt. Washington. I have no problem finding the photos in Picasa, because I&rsquo;ve got an album called &ldquo;Mt. Washington Trip 2006&rdquo;. I can open the album and browse through thumbnails of the photos, looking for that great one that I took from the summit. But when I try to email it to Ryan from my Yahoo! Mail account, I have to browse through the file system to find the file. Even though I have the photo up in Picasa&mdash;Right there! That one!!&mdash;I can&rsquo;t communicate that in an intuitive way to the web browser. Luckily, I know how to map from a photo in Picasa to the corresponding file on disk, but many people would not. Picasa provides a great abstraction: instead of thousands of files with unintelligable names like <span class="caps"><span class="caps"><span class="caps"><span class="caps"><span class="caps">IMG</span></span></span></span></span>_1792.jpg, it lets us work with the pictures, captions, and albums. But Picasa&rsquo;s abstraction is like a dialect private to an isolated town: as soon as we leave, we are forced to use the computer equivalent of grunts and hand signals.</p>
<p>All of these problems are caused by the fact that by using many different specialized applications for personal information management, the data is segregated based its form. Using the term segregated isn&rsquo;t an exaggeration&mdash;in some ways, the data is literally not allowed to mix together. For example, I&rsquo;d like to gather a digital scrapbook of my trip to Europe. It would have emails that I sent and received during the trip, contact information for the people that I met, bookmarks for various places that I stayed, and of course, lots of digital photos. On most systems, this is difficult, if not impossible. It can be done in a crude way by copying some files into a folder and cutting and pasting into text files. But then I would lose all the features of the specialized applications, like captions on the photos.</p>
<p>In short, I believe that there are several usability problems caused by the fact that we use many different specialized applications for managing our data. We can become frustrated and confused by incompatible data models and inconsistent features. It&rsquo;s harder to find the information we are looking for, because we have to remember what form the data is in. Communication between applications is awkward because they don&rsquo;t speak the same language. The data is stuck in silos, segregated by its type. This prevents us from using perfectly natural ways of organizing our data.</p>
<h2>Towards a Solution</h2>
<p>Now that we&rsquo;ve established what the problem is, the question is: what can we do to fix it? Obviously we can&rsquo;t expect to have a single application which will support all of our needs. We still need specialized software like iPhoto for managing photos, and GMail for email. I think that the problem is not really with the applications themselves, but with the platform they&rsquo;re built upon.</p>
<p>In software terms, a platform is a collection of common routines, and a set of interfaces allowing applications to use the routines. Normally, an application is built directly on the routines provided by the operating system. Developers and designers have long understood that an inconsistent user interface is difficult to use, so the UI is built into the platform, resulting in applications that mostly look and feel the same. In order to achieve the same kind of consistency with information management features, we need a platform designed for the manipulation of rich information.</p>
<p><pullquote>While the amount of information that the average person deals with has increased dramatically in the last 20 years, file systems have hardly changed at all.</pullquote>
All modern operating systems do in fact provide a common way to manage information: the file system. Unfortunately, while the amount of information that the average person deals with has increased dramatically in the last 20 years, file systems have hardly changed at all. We are still stuck with the old file and folder model. The problem with this model is that an increasing amount of data just doesn&rsquo;t fit into it. For example, a single email usually does not correspond directly to a file on the local disk. Another example is bookmarks&mdash;many people collect and organize hundreds of bookmarks, but a bookmark is not a first-class object like a file.</p>
<p>In a broad sense, this new information management platform that I am proposing is really just a new kind of file system, based on the needs of today&rsquo;s users. We need a system that will make it easier to manage and navigate the large amounts of rich and diverse information that people deal with every day.</p>
<p>In the first part of this article, I identified five distinct usability problems, all caused by the fact that we use many different specialized applications for managing our data:</p>
<p>1. Inconsistent features between applications<br />
2. Incompatible data models<br />
3. Difficult to find data, because we have to know where to look based on the type of the data<br />
4. Awkward to share data between applications<br />
5. Inability to mix different types of data together</p>
<p>In the same way the user interfaces are much more consistent because applications all use the same toolkits, then having a common information management framework that other applications can build upon will go a long way towards a more consistent set of interactions. I&rsquo;d like to outline what I think are the key requirements for such a framework to be successful.</p>
<p><strong>Requirement 0: Be a useful and usable framework</strong></p>
<p>Only if it&rsquo;s actually <em>used</em> can an information management framework help solve the problems I&rsquo;ve identified here. The framework must be easy for application developers to build upon, and it must be useful enough to be worth their effort. By building on this framework, application developers would be able to focus on the core functionality of their applications, rather than wasting their time reinventing common information management features.</p>
<p><strong>Requirement 1: Extensible for new kinds of data</strong></p>
<p>By having applications build upon this framework, we eliminate the problem of having incompatible data models. But the platform must be extensible to be able to handle new types of data. The reason that we have to deal with the different data models of specialized applications is because the existing platform (the file system) was not suited for managing the rich data that today&rsquo;s applications require. If the framework I&rsquo;m proposing is not built from the ground up to be extensible, we will quickly find ourselves in the same situation we are now: trying to do today&rsquo;s job with yesterday&rsquo;s tools.</p>
<p><strong>Requirement 2: Comprehensive search capability</strong></p>
<p>The third problem I identified was that it&rsquo;s difficult to find data, because you have to know where to look depending on what form the data is in. If it&rsquo;s in an email, you have to search in one place, but if it&rsquo;s in a file on your hard drive, you have to search in another place.</p>
<p>While search is not the answer to all our information management problems, it is a very useful feature. Now that Google is a verb, most people are comfortable using search as a primary way to find data. A new platform for information management should provide advanced search capability. Apple has done the right thing by building Spotlight&rsquo;s sophisticated search functionality into the operating system, and allowing applications to build upon it.</p>
<p>But in order for search to be truly effective, we need to be able to search all of our data at once, instead of having to search in each of the individual silos. Having a single framework for managing rich information means that it will be able to search through all different kinds of data, no matter what form it takes.</p>
<p><strong>Requirement 3: All data on equal footing</strong></p>
<p>One of the problems I identified with current information management systems is that it&rsquo;s difficult, if not impossible, for different types of data to be mixed together. You can&rsquo;t create a folder that contains an email, a photo album, and some bookmarks. This problem is also related to the problem of inconsistent features and data models. Things that can be done with one type of data, like a file on the file system, can&rsquo;t necessarily be done to other kinds of data.</p>
<p>In other words, there is an artificial distinction between different types of data. What a bookmark, an email, and a text file all have in common is that they are distinct, discrete pieces of information. If the purpose of the file system is to allow the user to store and organize information, then it should be able to treat these kinds of items equally. All types of data must be on equal footing. Anything that can be done with a file&mdash;like copying, searching, or sorting&mdash;should be possible with other pieces of information. If all data is on equal footing, then it would be possible to have a folder containing several different types of data.</p>
<p><strong>Requirement 4: Flexible organization features</strong></p>
<p>The folder (or directory) is the most common organizational metaphor used on computers. Originally, this concept was designed to be analogous to a physical file folder, so a document could only ever be in one folder. But it often makes sense for a document to be in two different folders at the same time. For example, if you had tickets to take a client out to a hockey game, should you put them in the &ldquo;hockey&rdquo; folder, or the &ldquo;work&rdquo; folder?</p>
<p>In information architecture, it&rsquo;s good practice to support several paths to a piece of information. This is generally because we need to support many different users, who might have a different mental models. But even with a single user, there are sometimes several different mental models involved. Just today I went looking for my wallet, and couldn&rsquo;t find it anywhere&mdash;although I&rsquo;m sure I put it somewhere that made sense at the time.</p>
<p>The idea that an object could exist in multiple folders is known as multiple classification, and it has recently become popular in the form of tags. Flickr, Del.icio.us, and many other web services allow you to associate several keywords with your data. By doing so, you are indicating that the data falls into various categories, with the idea that this will help you or someone else more easily find the data later.</p>
<p>Providing support for multiple classification is just one example, but in general, for a new information management platform to be successful, it must be flexible enough to allow you to organize your data however you want.</p>
<h2>Conclusion</h2>
<p>In the first half of this article, I identified several usability problems with the current state of information management software. We use many different specialized applications for dealing with different kinds of information, and the applications have inconsistent features and incompatible data models. It is harder to find our data, because we need to know what form it is in, so that we can look in the right place. It&rsquo;s awkward, and sometimes impossible, to share data between applications, and to mix the data together outside of the specialized applications.</p>
<p>To solve these problems, I proposed a platform that could be used to build the next generation of information management applications. Having a common platform for developers to build upon would give us greater consistency between applications&mdash;they would have the features we expect, and these features would work in the same way. Integration between applications would be much easier, as they would have a lingua franca for exchanging rich information. Different kinds of data could be mixed together, allowing users to easily organize their data in a way that makes sense to them.</p>
<p>I proposed five requirements for such an information management framework:</p>
<ol>
<li><b>Be a useful and usable framework.</b> This should go without saying, but it&rsquo;s important to keep in mind that this framework can only help solve our information management problems if it is useful, and it is attractive for developers to build upon</li>
<li><b>Extensible for new kinds of data.</b> If the system is not built to be extensible, we will soon find ourselves right where we are now: doing today&rsquo;s job with yesterday&rsquo;s tools.</li>
<li><b>Comprehensive search capability.</b> This one should speak for itself. With the overwhelming amounts of information that we have to deal with, advanced search capability is an indispensable feature.</li>
<li><b>All data on equal footing.</b> Several of the problems I identified stem from the fact that in current systems, certain types of data are not first-class.</li>
<li><b>Flexible organizational features.</b> You should be able to organize your data in whatever way works best for you.</li>
</ol>
<p>I believe that these requirements provide a good starting point for an information framework that application developers could build upon, and ultimately give us an easier, more usable set of information management tools.</p>
<p>And then I would have no more excuses for being disorganized.</p>
]]></content:encoded>
			<wfw:commentRss>http://boxesandarrows.com/doing-todays-job-with-yesterdays-tools/feed/</wfw:commentRss>
		<slash:comments>20</slash:comments>
		</item>
		<item>
		<title>Enterprise Information Architecture: A Semantic and Organizational Foundation</title>
		<link>http://boxesandarrows.com/enterprise-information-architecture-a-semantic-and-organizational-foundation/</link>
		<comments>http://boxesandarrows.com/enterprise-information-architecture-a-semantic-and-organizational-foundation/#comments</comments>
		<pubDate>Wed, 29 Nov 2006 03:56:53 +0000</pubDate>
		<dc:creator>Tom Reamy</dc:creator>
				<category><![CDATA[Findability]]></category>
		<category><![CDATA[Professionalism]]></category>
		<category><![CDATA[Special topic: Search and Metadata]]></category>
		<category><![CDATA[Workplace and Career]]></category>

		<guid isPermaLink="false">http://boxesandarrows.com/enterprise-information-architecture-a-semantic-and-organizational-foundation/</guid>
		<description><![CDATA[People disagree on what happens when IAs grow up, but Tom Reamy knows. He offers a foundation for information architecture as it advances, grappling with problems across the enterprise.]]></description>
				<content:encoded><![CDATA[<pullquote>&#8220;Whatever it is that information architects learn to become enterprise information architects, I think it is essential that we not lose our focus. The heart of IA is information and knowledge, and we need to build on that foundation, not try to turn into something else.&#8221;</pullquote>
<p>In “<a href=""http://www.boxesandarrows.com/view/succeeding_at_i">Succeeding at IA in the Enterprise</a>,” James Robertson calls for Enterprise Information Architects to pay attention to the realm of business strategy, and Bob Goodman in “<a href="http://www.boxesandarrows.com/view/change_architecture_bringing_ia_to_the_business_domain">Change Architecture: Bringing IA to the Business Domain</a>” offers the notion of EIAs as change agents or change architects.</p>
<p>Yes, EIAs can be a force for change, but so can anyone. The real question is: should the discipline of Enterprise Information Architecture be defined to include organizational change as one of its essential features. I don’t think it should be.</p>
<p>Whatever it is that information architects learn to become enterprise information architects (EIA), I think it is essential that we not lose our focus.  The heart of IA is information and knowledge, and we need to build on that foundation, not try to turn into something else when we add the term “enterprise.”</p>
<p>It is the same for business strategy.  Yes, EIAs should better understand their organization’s business strategy and integrate the strategic vision.  And, yes, EIAs can contribute to the development of business strategy, but not directly. We should not try to tell business analysts how to do their job. What we can do is provide input into the development of business strategy based on our understanding of and access to information and how those issues are essential to any business strategy.</p>
<p>Integrating business strategy into EIA is only a part of what it means to move from IA to EIA, and I’d like to take a look at the essential issues that IAs bring to the enterprise and what sorts of things IAs need to add and/or learn to develop an enterprise perspective.</p>
<p><span class="subhead">The move towards “Enterprise”</span><br />
To start we should recognize that adding “enterprise” to our titles is part of a broader trend toward enterprise solutions in a variety of fields from technology to management restructuring.  For example, content management, knowledge management, and learning management software vendors are all moving toward creating platforms for the entire enterprise that cover the entire life cycle of information and knowledge.</p>
<p>However, the truly essential feature of enterprise solutions &#8212; and what we have to recognize if we want to become EIAs &#8212; is that moving to an enterprise perspective does not mean simply doing bigger or more varied projects. Rather, it is a move away from a project-centric model toward an infrastructure model.</p>
<p>In order to have an enterprise-wide solution, you must develop an infrastructure that enables projects to be fully integrated, that enables projects to build on a common foundation rather than always starting from scratch, and that enables projects to be accomplished cheaper and faster with a broader impact.</p>
<p>The next step is the most important and the most difficult.  It is the creation of a solid, well-articulated semantic infrastructure. </p>
<p><span class="subhead">Semantic infrastructure</span><br />
A semantic infrastructure consists of all the various kinds of information and content in an organization plus the ways that information and content is organized and structured.  It includes structured and unstructured documents, tacit knowledge, and other non-document-based content like images and animation.  It also includes the policies and procedures around the creation and dissemination of that content, the technologies that support content creation and dissemination (search, CM, portals, etc.), and finally, and most importantly, the ways that content is structured such as metadata, taxonomies, controlled vocabularies, semantic networks, ontologies, social network analysis representations, knowledge and topic maps, and other advanced knowledge representations.</p>
<p>A semantic infrastructure also includes modeling and/or mapping people and their activities.  This includes formal and informal communities of all kinds, the information behaviors associated with those communities, linguistic and category preferences, and most of all, how information is used within the full range of business and support activities that these communities engage in.</p>
<p>In my opinion, it is this enterprise-wide semantic infrastructure that is the context within which EIA should operate, and it is the essential feature of defining what an enterprise information architect does and understanding how EIAs differ from IAs.</p>
<p><span class="subhead">Distinguishing EIA from IA</span><br />
Two key elements distinguish an enterprise IA from a basic IA. The first is the role an EIA plays in the design, development, and maintenance of an enterprise’s semantic infrastructure.  The second is the scope and type of projects an EIA can be involved in as they develop applications that use and build on this semantic infrastructure. </p>
<p>Let’s start by looking at how an EIA might help develop a semantic infrastructure.  A complete answer is beyond the scope of this article, but a simple answer is: IA skills and tools can support research that is fundamental to the creation of a well-designed semantic infrastructure.  This can include such activities as researching information needs and behaviors of different users, mapping different communities in an organization, and ethnographic studies.</p>
<p>Just as more and more enterprises become convinced of the importance of content structures like taxonomies, however, EIAs should become more aware of and focus more attention on these content structures. EIAs should also be more aware of how different groups categorize the world around them in different ways and what those differences mean for information architecture.  For example, as someone’s knowledge of a field increases, they tend to focus on the more precise levels of a taxonomy while novices tend to prefer higher level categories. Mapping these category level preferences into designs is an important component of a semantic architecture. This is a natural extension of the traditional work on metadata and labeling that goes into IA work today. </p>
<p>The type and scope of applications and projects is the second key element that distinguishes an IA from and EIA.  For traditional information projects like search and portals, and even individual intranet website projects, it is important to approach them with an enterprise-wide perspective.  One of the reasons these types of projects fail to deliver their full value is that they do not take into account the entire enterprise.</p>
<p>In addition, enterprise information architects work on a broader range of projects, not just traditional information projects like search and Content Management.  These other projects could include adding an important emphasis on how information organization and presentation impacts all business activities.  (But not to tell other groups how to run their business activities.)</p>
<p>EIAs should also be involved in knowledge management projects like collaboration, communities of practice, and innovation programs. Too often the failure of knowledge management programs is blamed on culture, when all it needed was an understanding of and support for the various group’s information behaviors – a really good information architecture.</p>
<p>Finally, EIAs should not just be involved in the enterprise’s information architecture, but also involved in the information architecture <i>of</i> the enterprise. They should apply IA skills to understand, model, and support how information and knowledge flows within the enterprise.  </p>
<p>This could include studying and developing new information architectures for how people do everyday jobs, how desktops or webtops are set up, how those designs impede daily information use, and how to improve those designs.  It could also use an expanded social network analysis of who gets their information from what people or content repositories and what kinds of information are not reaching critical audiences.  In other words, instead of focusing on the design of the intranet, EIA can focus on the people in the organization and support the information seeking strategies and behaviors that are part of their daily routine.  </p>
<p><span class="subhead">The place of EIA in the enterprise</span><br />
The last question I want to examine is where EIA fits in within today’s organizations. It is clear that a semantic architecture needs a broad, interdisciplinary team.  It should include members from IT, business units, and information professionals like librarians, information architects, and others.</p>
<p>In addition, this interdisciplinary team will need to partner with other departments throughout the enterprise, including business, technology, sales, administration, and research. </p>
<p>The actual makeup of the team and where it is located organizationally will vary from industry to industry, but from experience we know it should not be located in IT.  IT should be involved; people from IT should be on the team, but despite the fact that “information” appears in their department description, we should remember that they are not really information professionals.  </p>
<p>For example, many enterprise search, content management, and portal projects fail to deliver full benefits because what is needed to make them work goes way beyond traditional “needs assessments.”  It is not enough to have people go out and ask users what they need.  You need to have information professionals (IAs, librarians, knowledge engineers, etc.) to develop new creative possibilities to offer to users.  You need someone who can ask deeper questions about information behaviors.  </p>
<p>Having IT, business analysts, and subject matter experts involved is important and necessary, but none of those three groups understands information and knowledge at a sufficiently deep level to offer truly creative and innovative alternatives that make information and knowledge systems work across the whole enterprise.</p>
<p>On the other hand, it has been suggested that EIA should lead this central information/knowledge team.  Personally, I don’t think this is a good idea for a number of reasons having to do mostly with the difference between knowledge and information and the background/training that most IAs have. However, EIA might be one candidate to lead this interdisciplinary team, as long as we start with a greatly expanded definition of the role of EIA based on a stronger emphasis on semantic architecture.  </p>
<p>The other major candidates for leading this team are knowledge management and learning departments.  Learning groups are a good place, but like IA, traditionally, they don’t deal with the depth and breath of knowledge modeling that is needed.  Learning is only one type of information and knowledge activity and we need something to cover the full spectrum of activities: searching, using existing information to create new documents, writing reports, and even using information to make a sale.</p>
<p>Personally, I think that knowledge management departments are a very good place to locate this team, but it does require KM to shift focus away from an over-reliance on the idea of tacit knowledge and move towards actually modeling knowledge&#8212;in other words, KM with more emphasis on knowledge than management. </p>
<p>Knowledge architecture is a term that I think captures the essential nature of this interdisciplinary team and, at the same time, adds the focus of actually modeling knowledge to KM. However, it is not a widely used term and requires significant explanation. </p>
<p>Whatever term is used will require some redefining (KM), some expansion (EIA), or some explanation and general acceptance (KA). Take your pick or come up with an entirely new one.  </p>
<p>However, what is more important than who will lead this team is to pay attention to how all the members of this interdisciplinary team communicate among themselves. The team will need the services of a good IA to help the team communicate and interact with the rest of the organization. Regardless of who leads, the Enterprise Information Architect will always be an essential part.</p>
<p><span class="subhead">Conclusion</span><br />
There is a great deal more that could be said be but in the interests of brevity, let me close with this thought.  This vision of enterprise information architecture might not be what you think of or want an EIA to be, but this vision is meant to be a broad one that attempts to locate EIA within an even broader context that ultimately consists of some form of knowledge management.  Within this vision of EIA, however, there is room for many different IA roles.  </p>
<p>Some EIAs might work closely with the team doing more advanced knowledge representations. Some might work with the semantic specialists like taxonomist and metadata designers. Some might work with business strategy groups, and some might work with business process re-designers.  On the other hand, some might work on traditional intranet projects, some with search or portal projects, and some doing traditional IA on new knowledge management projects.</p>
<p>For me, it is not so much that EIA represents a new field, so much as expanding the definition of what information architecture is and, at the same time, locating IA within a new enterprise approach to information and knowledge.</p>
]]></content:encoded>
			<wfw:commentRss>http://boxesandarrows.com/enterprise-information-architecture-a-semantic-and-organizational-foundation/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Long Tails and Short Queries</title>
		<link>http://boxesandarrows.com/long-tails-and-short-queries/</link>
		<comments>http://boxesandarrows.com/long-tails-and-short-queries/#comments</comments>
		<pubDate>Tue, 17 Oct 2006 03:30:35 +0000</pubDate>
		<dc:creator>Christina Wodtke</dc:creator>
				<category><![CDATA[Findability]]></category>
		<category><![CDATA[Interfaces]]></category>
		<category><![CDATA[Interviews]]></category>
		<category><![CDATA[Learning From Others]]></category>
		<category><![CDATA[Special topic: Search and Metadata]]></category>

		<guid isPermaLink="false">http://boxesandarrows.com/long-tails-and-short-queries/</guid>
		<description><![CDATA[Why haven't we figured out search yet? Amanda Spink talks with Christina Wodtke on why searchers still can't ask a useful question of a search engine, and how Google may be part of the problem rather than part of the solution.]]></description>
				<content:encoded><![CDATA[<pullquote>&#8220;People don&#8217;t understand their own information behaviors, and they don&#8217;t really understand much about search or the web, so they will have to learn. It could take generations.</pullquote>
<p>Amanda Spink is one of the smartest people working on user behavior while using web search, yet when I mentioned her name to a friend who&#8217;s spent the last year working on the search user experience, he had never heard of her. The design community is woefully undereducated about search, and is often prone to redesigning Google and postulating what Yahoo! is doing wrong, rather than working to understand why search engines have chosen to do what they do. I suppose this shouldn&#8217;t be surprising, though, considering Spink&#8217;s work is more often seen in scholarly journals, such as <i>New Directions in Human Information Behavior</i> and <i>Journal of the American Society for Information Science and Technology</i> (brought to you by the same folks who bring you the IA Summit, yet rarely cracked by the working folks).</p>
<p>In order to help correct this problem, I shyly contacted my hero by email, and overcame the time difference between sunny California and even sunnier Brisbane, Australia, with a series of email questions. </p>
<p><b>Christina Wodtke</b>: When I joined Yahoo!, I had never worked on search before. Your article, &#8220;<a href=http://portal.acm.org/citation.cfm?id=619073.621942&#038;dl=&#038;dl=acm&#038;CFID=15151515&#038;CFTOKEN=6184618>From E-Sex to E-Commerce: How Search Changes</a>&#8221; was one of the most valuable in beginning to get my mind around the problem of search. Since reading that article, however, I haven&#8217;t seen much change from your findings. In your opinion, are users changing their search behavior, or are they still following the same patterns you found when you analyzed Excite&#8217;s data?</p>
<p><b>Amanda Spink</b>: You make a good observation. Since 1997, our findings have come from analyzing large-scale web user data gathered from commercial web companies, including Excite, Ask Jeeves, Alltheweb.com, Alta Vista, Vivisimo, and Dogpile. Our research since 1997 shows some trends and changing patterns in general searching. However, looking at more recent data from Vivisimo and Dogpile, most web queries are still short&#8212;2 to 3 terms, and sessions include little query modification and are generally 2-3 queries in length. </p>
<p>Few people use advanced search features, and many queries include spelling and other mistakes that adversely affect the search results. People look at only a few result pages&#8212;not beyond the first or second results pages.</p>
<p>A small number of terms are used with high frequency, and many terms are used once. Web queries are very diverse in topic and some, such as people&#8217;s names, are unique. </p>
<p><b>CW</b>: You&#8217;re referring to the &#8220;long tail&#8221; of a Zipf curve? How does this affect search engine&#8217;s strategies in providing relevant results?</p>
<p><b>AS</b>: The long tail makes &#8220;relevant&#8221; retrieval very difficult, especially if users are only providing the search engine with two to three words on which to base relevance judgments. Despite the inherent &#8220;interactive&#8221; nature of search, much of search is not very interactive. The talk about personalization is an attempt to obtain more information from the user to help determine relevance. </p>
<p><b>CW</b>: So are users changing their overall behavior at all? </p>
<p><b>AS</b>: We are seeing a growth in more complex search behaviors. More people are searching for information using more than one search. This might mean repeat searches of the same query over time or modifying the queries in successive searches over time. Many people are multitasking or searching for information on more than one topic during a search session. People&#8217;s information needs are often quite complex in their home and work environments. </p>
<p><b>CW</b>: How are they handling that? </p>
<p><b>AS</b>: During multitasking search they may include two or more topics in one window, or open new windows for each topic and run searches concurrently. </p>
<p><b>CW</b>: Could you talk a bit more about &#8220;complex search behaviors&#8221;? </p>
<p><b>AS</b>: Peoples&#8217; information seeking behavior can often be long and complex. Imagine a person is looking for information on cars. He conducts one search on one search engine, looks at the results, and tries another search engine, or goes back to the first search engine, and repeats the same search with the same search terms, or he may add or remove some terms (query reformulation). This is called <i>successive searching</i>.</p>
<p>In addition, research shows that people often search for more than one topic during their interaction with a search engine. They may batch their topics due to time constraints or new topics may evolve during their search session. This is called <i>multitasking search</i>. </p>
<p>Both phenomena are examples of more complex behaviors beyond the one-topic, one-search paradigm that most search engines assume. </p>
<p><b>CW</b>: If queries are still so short, what are some of the more successful disambiguation tools used by the search engine? Vivisimo and Clusty offer algorithmically generated groupings and present them as narrowing tools. But last time I was testing these tools with users, the narrowing options were essentially invisible to them. And despite Jakob Nielsen&#8217;s assurance (admittedly in 1999) that longer search boxes produce longer queries, I&#8217;ve never seen it happen. Are there better and worse ways to encourage richer queries from users?</p>
<p><b>AS</b>: So far no commercial tool seems to be effective at helping users on a large scale. Search engines have not used longer boxes, so no one really knows what would happen on a large scale if text boxes were changed. The best way to encourage richer queries is to train users and expect them to put more effort into their search behavior. Search engines need to put more demands on the users. People don&#8217;t understand their own information behaviors, and they don&#8217;t really understand much about search or the web, so they will have to learn. It could take generations. </p>
<p><b>CW</b>: Really? Many folks who revamp search, either by adding Google, Yahoo!, or another vendor, seem to be leaning toward long entry boxes. I&#8217;m thinking about CNN, NY Times.com, CNET.</p>
<p><b>AS</b>: The Google and CNN text boxes may be a little bigger or longer than average, but not substantially longer. How about a structured textbox, like an electronic library catalog interface? How about a textbox that is 3 inches by 3 inches with lots of space for people to express themselves? If you give people a small text box, you&#8217;re probably constraining their expression of their information problem. </p>
<p>People need to feel they should play around with search and experiment. All they can really do at present is squeeze in a few words, press search, and look at a list of websites-the list giving little indication of what the websites mean or how they are ranked. One major problem is that [designers of] search engines tend to think that one technique will do it! What they need to do is test combinations of many techniques, such as clustering, relevance feedback, etc. There is no silver bullet here. </p>
<p><b>CW</b>: When you speak of training users, I&#8217;ve found that very challenging, much more with search than any other Internet paradigm. I&#8217;ve been in a lab with a fellow who has used Google for five years, and he never realized &#8220;cached&#8221; was there until I pointed it out. How can train people who have &#8220;banner blindness&#8221; for most of the page?</p>
<p><b>AS</b>: I think this is a major problem for the web industry. How to train billions of people? Whoever comes up with the best solution for that question may capture huge market share. The paradigm needs to change. Search is challenging and interactive, and maybe a &#8220;game&#8221; paradigm would help.</p>
<p><b>CW</b>: The short-query phenomena is fascinating. In a lab, I have asked people why they typed, say, &#8220;sailboats,&#8221; and they&#8217;ll say, &#8220;Oh, I&#8217;m interested in taking classes next summer when we&#8217;re up at the lake in Michigan,&#8221; yet none of the words made it into their query. Any insights?</p>
<p><b>AS</b>: Our research shows that the most effective search terms are those submitted by the user, from a user&#8217;s interaction with another person about their topic, and terms they identify on the screen from the retrieved output. Stimulating users to talk with someone or thing (agent) about their information problem helps generate terms and look at the results for additional terms. </p>
<p><b>CW</b>: Hey, those sound like classic reference librarian techniques!</p>
<p><b>AS</b>: One area that some web developers are exploring is classic reference librarian techniques. It&#8217;s an obvious area to explore to understand information behavior and how librarians have helped people with their information problems.</p>
<p><b>CW</b>: &#8220;E-sex and E-commerce,&#8221; was referring to a topical shift in searches. Are you continuing to see changes in what people are searching for?</p>
<p><b>AS</b>: I think it&#8217;s important not to assume what &#8220;people are searching for&#8221; means just U.S.-based search engines. There are major differences emerging in search in different global regions. For the more U.S.-based search engines, the topics seem to have stabilized somewhat with business and e-commerce related searches being the largest category, followed by people, places and things, computers, and medical/health. Sex/porn and entertainment is now a smaller proportion of searches. From what I&#8217;ve seen about the Chinese search engines (e.g., Baidu), users are looking for entertainment and gaming. One could say that the Chinese search engine users are where the U.S. users were 5-10 years ago. As more Chinese business information is accessible via Baidu, the search topics may change. Also, currently less than 10% of the Chinese population search the web, so as that number increases, topic may change. </p>
<p><b>CW</b>: There are endless articles these days about search privacy, and Google giving information to the feds. Is the ordinary person on the street worried about that?</p>
<p><b>AS</b>: This is an important area for everyone. If search engines and the web are becoming the primary tool by which people are expected to access information, then privacy and the practices of the government in regulation or companies is crucial. Much like the way we see telephones and TV in the past, as involving privacy, commercial, and government interests. Also, because search is now ubiquitous, politicians will seek to gain political advantage or grounds for industry regulation. Ordinary people should be concerned about how political and commercial information policies will affect their access the web.</p>
<p><b>CW</b>: And you are studying the evolution of human information behavior. How far back are you going? Medieval libraries? Cavemen looking for the right painting?</p>
<p><b>AS</b>: Obviously humans evolved information behaviors before preliterate societies through cave art, etc. Information behaviors evolved to help humans complete and cooperate, as the technologies evolved from cave art to the web. In fact, people may not change their information behaviors, but may have evolved over time to utilize a greater capacity for more complex information behaviors. </p>
<p>The Spink and Currier paper talks about the information behaviors of Darwin, Napoleon, and Casanova-all very effective people at finding and using information. And what we write about Casanova many people have found fascinating!</p>
<p><b>CW</b>: Now you are being cruel! I&#8217;m going to have to renew my library card. Can you predict trends in behavior from your research? What&#8217;s next on the horizon?</p>
<p><b>AS</b>: What&#8217;s next on the horizon is developing an understanding of how human information behaviors have evolved over human history, how they evolve over a person&#8217;s lifetime, how their search interactions develop over time, and how search in the aggregate is evolving over time. In other words, we need more longitudinal studies. </p>
<p><b>CW</b>: Any bits of advice to practitioners about to attack the search tools on their sites? Lessons from web search?</p>
<p><b>AS</b>: A key problem for practitioners is the lack of computer people trained in information and web retrieval, web design, and web usability. There is a lack of good trained people and not many industry consultants who really understand search.  Search is much harder than most people think, and the design of effective search tools is even harder. Practitioners need to really test any search engines they consider buying. Many companies claim that their search engines are effective and the best, but provide little real evidence for their claims. </p>
<p>Be careful of the search engine that promises effectiveness and superiority based on a &#8220;single&#8221; feature, e.g., linking or clustering. There is no silver bullet feature.  We don&#8217;t yet have Search engines that have adopted a more holistic attitude based on a real understanding of search, people&#8217;s information behavior and what is really effective. Whoever takes that path effectively will gain competitive advantage in the marketplace. </p>
<p><morebox><br />
<b>For More Information</b></p>
<p>Spink, A., &#038; Currier, J. (2006). Emerging evolutionary framework for human information behavior. In: A. Spink &#038; C. B. Cole (Eds.), New Directions in Human Information Behavior. Berlin: Springer (pp. 13-31).</p>
<p>Spink, A., &#038; Cole, C. B. (2006). Human information behavior: Integrating diverse approaches and information use. Journal of the American Society for Information Science and Technology, 57(1) 25-35. </p>
<p>Spink, A., &#038; Currier, J. (2006). Toward an evolutionary perspective on human information behavior: An exploratory study. Journal of Documentation, 62(2), 171-193.</morebox></p>
]]></content:encoded>
			<wfw:commentRss>http://boxesandarrows.com/long-tails-and-short-queries/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Four Modes of Seeking Information and How to Design for Them</title>
		<link>http://boxesandarrows.com/four-modes-of-seeking-information-and-how-to-design-for-them/</link>
		<comments>http://boxesandarrows.com/four-modes-of-seeking-information-and-how-to-design-for-them/#comments</comments>
		<pubDate>Tue, 14 Mar 2006 17:19:55 +0000</pubDate>
		<dc:creator>Donna Spencer</dc:creator>
				<category><![CDATA[Big Ideas]]></category>
		<category><![CDATA[Design Principles]]></category>
		<category><![CDATA[Findability]]></category>

		<guid isPermaLink="false">http://boxesandarrows.com/four-modes-of-seeking-information-and-how-to-design-for-them/</guid>
		<description><![CDATA[Information-seeking behavior varies from situation to situation. Donna Maurer explores different ways in which users look for information and offers tactics for accommodating them.]]></description>
				<content:encoded><![CDATA[<pullquote>&#8220;Observe how your users approach information, consider what it means, and design to allow them to achieve what they need.&#8221;</pullquote>
<p><i>I discovered the concepts in this article while preparing material for an introductory information architecture workshop. In the workshop, I thought it important to highlight that one aspect of designing for users was to understand the ways in which they may approach an information task. I was already familiar with the concepts of known-item and exploratory information seeking: they are common in the library and information science literature and are also discussed in <a href="http://www.amazon.com/gp/product/0596000359">Information Architecture for the World Wide Web</a>.</i></p>
<p>In my work on intranets and complex websites, I noticed a range of situations where people didn’t necessarily know what they needed to know. Additionally, when I opened my browser history to look for examples from recently-visited sites, I noticed that the majority of my own time was spent trying to find things that I had already discovered.  These two modes didn&#8217;t fit into the concepts of known-item and exploratory information seeking. I call these “don’t know what you need to know” and re-finding.</p>
<p>I spent a while letting this rattle around my head, talking with IAs and designers, and realized that most only thought in terms of known-item searching. When discussing the other types of tasks, they’d ask with a horrified look, “So how do you design for that?”</p>
<p>Let’s look at the modes of seeking information in some depth and their implications for web design.</p>
<p><strong>1. Known-item</strong><br />
Known-item information seeking is the easiest to understand. In a known-item task, the user:
<ul>
<li>Knows what they want</p>
<li>Knows what words to use to describe it
<li>May have a fairly good understanding of where to start
</ul>
<p>In addition, the user may be happy with the first answer they find (though not always) and the task may not change significantly during the process of finding the answer.</p>
<p>Some examples include finding out whether Katharine Kerr has a new novel, learning about how the CSS color:transparent attribute works, and getting a copy of the travel form. These are all clearly defined, easy to describe, and the starting point is straightforward.</p>
<p>There are a number of design approaches to help with this type of task:
<ul>
<li>Search. This is a particularly good solution: people can articulate what they need and are able to type it into a search box.  As long as the search results show the word in context or show a clear description of results, they are likely to recognise suitable pages from the search results.</p>
<li>A-Z indexes. These are great at supporting this mode, as users are able to articulate the word that they are looking for. As long as the A-Z contains the word the user is thinking of, all they need to do is read down the list and spot the right item.  One way to make sure that the list of terms in an A-Z index matches the words that users think of is to look at the terms used during user research or in the search logs.
<li>Quick links. Links to frequently used items allow easy access to them. Again, the terms in the list must match the users’ terms.
<li>Navigation. Browsing via navigation can support this behavior.  It is most likely to be effective when the user can clearly identify which navigation heading to choose from.
</ul>
<p>For this mode, it is important that people are able to answer their question quickly.</p>
<p><strong>2. Exploratory</strong><br />
In an exploratory task, people have some idea of what they need to know. However, they may or may not know how to articulate it and, if they can, may not yet know the right words to use. They may not know where to start to look. They will usually recognise when they have found the right answer, but may not know whether they have found enough information.</p>
<p>In this mode, the information need will almost certainly change as they discover information and learn, and the gap between their current knowledge and their target knowledge narrows.</p>
<p>As an example, a few years ago I was looking for information on the cognitive mechanisms that allow people to navigate the physical world (I was comparing the concept of online and physical navigation). I knew what I was after, but couldn’t describe it (‘navigation’ in a search engine would return results for web navigation). I had no idea where to start. I tried a number of places and didn’t succeed at all.  (Six months later I stumbled across some wayfinding papers and realised that was the term I needed). </p>
<p>Other examples of exploratory tasks include looking for history on the technique of card sorting, finding examples of sites with complex forms laid out using CSS, and finding music I like. </p>
<p>The first challenge can be getting the user to a good starting point (this was the main problem in the navigation example). This is less of a problem on an intranet as staff may only have one place to explore. Portal sites, subject-based directories, or sites with a wide range of content (such as <a href="http://en.wikipedia.org/wiki/">Wikipedia</a>) can provide avenues to follow on the open Web.</p>
<p>Design approaches for this mode include:
<ul>
<li>Navigation. The most successful design solution will be browse, via navigation of all types. Browsing allows people to take some chances and follow a path, exploring, discovering, and learning as they go.  Users may go deeper or broader in a hierarchy, or to related information.</p>
<li>Related information.  Related links may be created from a list of related topics, a manually created list of relevant pages, or lists based on items purchased or recommended by other users. Contextual links may also be included in the body of the content.
<li>Search. Search can be useful for exploratory tasks, but can be problematic due to the user’s inability to articulate what they are after. An initial search can help the user to learn about the domain and get some ideas for keywords. It can also be useful to provide synonyms for the search term as they may help the user to better articulate their query.<br />
For this mode, it is critical that there are always avenues for exploration and that the visitor never reaches a dead end.
</ul>
<p><strong>3. Don’t know what you need to know</strong><br />
The key concept behind this mode is that people often don&#8217;t know exactly what they need to know.  They may think they need one thing but need another; or, they may be looking at a website without a specific goal in mind.</p>
<p>This mode of seeking information occurs in a number of situations:
<ul>
<li>Complex domains such as legal, policy, or financial.  For example, a staff member may want to know how many weeks maternity leave they are entitled to, but may need to know the conditions surrounding that leave.  We should read the terms and conditions of new products and services as there maybe important restrictions, but they are too often buried in legal garble that we don&#8217;t read.</p>
<li>Any time we wish to persuade the user.  For example, we would love people to know more about information architecture and usability, but they often don&#8217;t know that the concepts even exist.  They may think they want to know how to make an accessible nested fly-out menu; we think they need to know more about organising the content properly.
<li>Unknown domains.  For example, when someone is told by friends that he or she should check out a new service, product or website, but does not yet know why he or she would want to know about it.
<li>Keeping up to date. People often want to make sure they keep up to date with what is happening within an industry or topic, but are not looking for a specific answer.
</ul>
<p>The challenge is providing an answer while exposing people to the necessary information, thus showing what they may need to know. This can be achieved by:
<ul>
<li>Straightforward answers. Simple, concise answers allow people to have their initial information need met. For example, in the four situations above the websites could include a summary of the maternity leave benefit, the key issues of concern in the terms and conditions, an outline of the benefits of the new website or service, and a list of latest releases respectively.</p>
<li>More detailed information. Make more detailed information easily available.  This may take the form of related links or contextual links in the body of the content.
</ul>
<p>The solutions allow people the satisfaction of getting an answer and then the opportunity to get additional information.</p>
<p><strong>4. Re-finding</strong><br />
This mode is relatively straightforward—people looking for things they have already seen.  They may remember exactly where it is, remember what site it was on, or have little idea about where it was.  A lot of my personal information seeking is hunting down information I have already seen.  I don&#8217;t know how prevalent this is, but discussions with others indicate that I am not alone.</p>
<p>Design solutions can be active (where the user takes explicit action to remember an item) or passive (where the user takes no action but items are remembered).</p>
<p>Active solutions exist on many web sites: wishlists (<a href="http://amazon.com">amazon.com</a>), &#8220;save for later&#8221; (<a href="http://emusic.com/">emusic</a>), and favorites (<a href="http://pandora.com">Pandora</a>).  These solutions work well but require a conscious effort from the user, who needs to know they will want to return to an item in the future.  <a href="http://del.icio.us">Del.icio.us</a> is another example of an active solution for the web as a whole.</p>
<p>A good passive solution allows users to see items they have seen before, order them by frequency of use, easily get to the content, and the information within it persists over time (longer than the current session).</p>
<p>Domains where passive solutions offer value include the following:
<ul>
<li>Shopping sites. Users may look at a number of products and may comparison shop before purchasing (e.g. <a href="http://target.com/">Target</a>, <a href="http://drugstore.com/">drugstore.com</a>, <a href="http://www.anthropologie.com">Anthropologie</a>, <a href="http://classygroundcovers.com">Classy Groundcovers</a>, <a href="http://www.expansys.ca/">Expansys</a>).</p>
<li>Weblogs.  Readers may revisit favorite posts and watch comments on a post.
<li>Article sites. Sites like Boxes and Arrows may have readers returning to their favorite articles frequently.
<li>Support sites. Readers need to return to the same help topics.
<li>Real estate sites. Potential buyers look at their favorite house over and over.
<li>Complex search facilities. Users may wish to retain their search, modify it, or rerun it.
</ul>
<pullquote>&#8220;The most important issue is not whether you notice a mode of seeking information that fits into one of these categories, but that a range of modes exist.&#8221;</pullquote>
<p><strong>Identifying the modes</strong><br />
Once you understand the modes, examples are easy to spot during user research.</p>
<p>Known-items show up in heavy use of search with accurate keywords, when users can easily list what they need from the site and support e-mail will ask for specific content.</p>
<p>Exploratory information seeking shows up in search when vague phrases or repeated searches for similar keywords are used; when users express that they are researching, looking for background information, or “finding out about” something; and when support e-mails ask for general information.</p>
<p>“Don&#8217;t know what you need to know” is a little harder to identify.  In interviews, users may express that they just want to keep up with things. It may also be clear that users do not have sufficient background knowledge or have not read information they should have. You can identify gaps in content by walking through the content, acting out a scenario from the user perspective, and checking that sufficient information is available. </p>
<p>Re-finding is easy to identify if your site has user registration and the logs show what pages people visit.  You can also look at the number of items in wish lists.</p>
<p><strong>Conclusion</strong><br />
The most important issue is not whether you notice a mode of seeking information that fits into one of these categories, but that a range of modes exist. Observe how your users approach information, consider what it means, and design to allow them to achieve what they need.</p>
<p>Note: Thank you to IAI members for suggestions for sites that offer navigation for the re-finding task.</p>
]]></content:encoded>
			<wfw:commentRss>http://boxesandarrows.com/four-modes-of-seeking-information-and-how-to-design-for-them/feed/</wfw:commentRss>
		<slash:comments>29</slash:comments>
		</item>
		<item>
		<title>The ABCs of the BBC: A Case Study and Checklist</title>
		<link>http://boxesandarrows.com/the-abcs-of-the-bbc-a-case-study-and-checklist/</link>
		<comments>http://boxesandarrows.com/the-abcs-of-the-bbc-a-case-study-and-checklist/#comments</comments>
		<pubDate>Mon, 19 Dec 2005 06:57:37 +0000</pubDate>
		<dc:creator>Helen Lippell</dc:creator>
				<category><![CDATA[Case Studies]]></category>
		<category><![CDATA[Findability]]></category>
		<category><![CDATA[Interfaces]]></category>

		<guid isPermaLink="false">http://boxesandarrows.com/the-abcs-of-the-bbc-a-case-study-and-checklist/</guid>
		<description><![CDATA[Can A-Z indexes hold their own against other popular navigational elements like search and sitemaps? Helen Lippell guides us through how the BBC online, a site with over two million pages, handled their own A-Z index.]]></description>
				<content:encoded><![CDATA[<pullquote>&#8220;We felt the need to make the A-Z more like a supermarket (comprehensive and well-organized) and less like a junk shop (random and gems buried amidst the clutter).&#8221;</pullquote>
<p>It is said that a tenth century Grand Vizier of Persia took his library of 117,000 volumes with him whenever he toured his kingdom. He trained his caravan of camels to walk in alphabetical order so that he could always find what he wanted. Luckily, these days it is somewhat easier to organize and present your content in alphabetical order.</p>
<p>A-Z indexes are sometimes seen as the less desirable counterpart to other navigational elements such as sitemaps and, especially, search. However, A-Z indexes can be a valuable secondary navigation tool, especially for large sites with a lot of granular content.</p>
<p>Because there are already a number of excellent articles online that talk about the value of A-Zs, I’d like to outline instead what we did at bbc.co.uk in the first half of 2005: namely, repositioning the site index as a viable secondary navigation tool. I’ll also offer a checklist of eight areas to consider when thinking about creating an A-Z site index. The list has already proved useful in advising <span class="caps"><span class="caps"><span class="caps"><span class="caps">BBC</span></span></span></span> colleagues with no background in indexing or information architecture on how to painlessly create local A-Zs for their particular areas of content.</p>
<p><span class="subhead">Project background</span><br />The project to overhaul the bbc.co.uk index began in late 2004. There was a push to change the visual design to make the user experience more aesthetically pleasing, and a need to address known usability issues with the existing A-Z index, including a lack of understanding of the multi-page format and lack of awareness of the special page for numeric entries. Unlike many web design challenges, the goal of the pages was to send the user off somewhere else as quickly as possible rather than keep them on the page.</p>
<p>We also needed to tackle the hybridized nature of the index, which was part alphabetic listing, part directory, and did not serve either of those models well. Research had shown that users had confused expectations about and varied success in using the index.</p>
<p>The huge size of bbc.co.uk precluded providing a granular index that could support comprehensively categorized headings. However, a more achievable goal was to enable most pages on the sprawling site to be reached within three clicks of a link in the index. One of the other key goals for the project was to create and communicate editorial guidelines to the rest of <span class="caps"><span class="caps"><span class="caps"><span class="caps">BBC</span></span></span></span> New Media. For the first time, we defined a definite level of granularity beyond which content would not be considered for inclusion. Our site has over two million pages; for this reason, we decided not to consider single pages of content for inclusion in the index. This was an easy guideline for stakeholders and content authors to grasp.</p>
<p><span class="subhead"> “More supermarket, less junk shop:” Defining design objectives</span><br />There is more to index development than sorting out words and phrases on a page. The success of the <span class="caps"><span class="caps"><span class="caps"><span class="caps">BBC</span></span></span></span> project also hinged on using proven user-centered design techniques. This included developing personas, using a creative brief, and working with a visual designer. The ultimate aim was to create an attractive yet functional page.</p>
<p><strong>Personas</strong><br />The project team developed two personas for the project. The secondary persona was someone who would never use the A-Z, except as a last resort: Stephen, who is 22 years old, from London, and who most definitely has a search mentality. The priority, however, was our primary persona (based loosely on a real participant from the first round of testing), who would help the team imagine how a real person would use what we were building. We named her Sheila.</p>
<p>Sheila is 59, retired, and lives in Newcastle. Her web experience is mainly limited to genealogy and browsing kids’ content on bbc.co.uk with her grandchildren. She has used email and has bought online, but without great confidence. She doesn’t really like searching, and prefers to scan a list of links even if it means scrolling.</p>
<p><strong>Design concepts</strong><br />A creative brief document cemented the business objectives and ensured that the tone, look and feel, and personality of the index would meet branding needs. As simple as the pages are, with extensive use of white space to aid scannability and legibility, they also look friendlier just from having a little color. The blue also means the A-Z is in sync with the wider bbc.co.uk brand color, as chiefly seen on the <a href="http://www.bbc.co.uk">homepage</a>.</p>
<p><img alt="Lippellhelen_img01" height="413" src="http://www.boxesandarrows.com/files/banda/the_abcs_of_the_bbc_a_case_study_and_checklist/LippellHelen_img01.jpg" width="438" /><br /><em>Top of a page of the A-Z index, showing the use of blue to match the bbc.co.uk brand color</em></p>
<p>When thinking about personality, we realized that there was nothing different about the A-Z as compared to any other navigation tools on the site. Everything should be “warmly informative” or “approachably organized,” wherever you’re browsing. Specifically, we keenly felt the need to make the A-Z more like a supermarket (comprehensive and well-organized) and less like a junk shop (random and gems buried amidst the clutter). Not all junk shops are like that, of course, but it was hard to see the bbc.co.uk A-Z as one of the good ones before the project!</p>
<p>Two further enhancements are worth mentioning. Statistical analysis had shown that many users of the previous version of the index never visited any page other than “A,” because that was the first one they would come to on clicking the A-Z link from somewhere else on the site. We decided to develop a new front page that had no alphabetic entries (other than Popular Links) but had links to each letter page instead.</p>
<p>Secondly, we introduced new “filters” for certain link types, such as those to web pages about <span class="caps"><span class="caps"><span class="caps"><span class="caps">BBC</span></span></span></span> shows, some of which also have video or audio content. These pages are in addition to the 27 main letter and number pages, which show only TV or radio program page links. The special filtered pages are intended to aid browsing for A-Z users coming to the index from virtually anywhere on the site, and also to be the first destination from broadcast-specific areas of the site (most notably, from <a href="http://www.bbc.co.uk/tv/">www.bbc.co.uk/tv</a>).</p>
<p><img alt="Lippellhelen_img02" height="135" src="http://www.boxesandarrows.com/files/banda/the_abcs_of_the_bbc_a_case_study_and_checklist/LippellHelen_img02.jpg" width="500" /><br /><em>Part of the bbc.co.uk/tv portal page, with ringed link to TV program-only part of the A-Z index</em></p>
<p><strong>Competitor analysis</strong><br />The team looked at around 20 other indexes from both elsewhere on bbc.co.uk and on the wider web. As well as recording subjective impressions of how well these indexes worked, we compared the indexes against a set of criteria such as intra-letter navigation, link readability, use of icons, use of color and/or whitespace, quality of link labeling, relevancy of content, and availability of other navigation tools (e.g., sitemaps).</p>
<p>From the set of bbc.co.uk indexes surveyed, <span class="caps"><span class="caps"><span class="caps"><span class="caps">BBC</span></span></span></span> Health’s <a href="http://www.bbc.co.uk/health/conditions">A-Z of illnesses and conditions</a> stood out, particularly for offering different approaches to browsing and for clean, attractive design. Of the non-BBC indexes, the <a href="http://www.aaai.org/AITopics/html/a2z.html">American Association for Artificial Intelligence’s index</a> was strong on cross-referencing (“See” and “See Also” references). It was also very usable due to its mix of scientific and non-scientific terminology to help support the site’s goal of making complex ideas accessible to a general audience.</p>
<p>From this design and research work I realized that some of the nuances of compiling an A-Z could be distilled into a short checklist for non-specialists and specialists alike. The list is illustrated with examples from throughout the bbc.co.uk A-Z index. I hope that the areas covered below are general enough to deal with scenarios from all kinds of knowledge domains.</p>
<p><span class="subhead">Eight-point checklist for creating terrific A-Z indexes</a></p>
<p><strong>1. Know your audience</strong><br />It’s vital to understand the way that your audience interacts with your website and your index. There are many ways of doing this; three that can be particularly useful are search log analysis, persona development, and user testing.</p>
<p>Search logs have been invaluable throughout the lifecycle of the A-Z to highlight the areas of interest to users. They also shed light on the language used by people trying to find things on the site. Even though the A-Z is a browse tool and search is a search tool, don’t ignore the common goals of their respective users: finding stuff easily.</p>
<p>As outlined earlier in the article, creating personas helped guide the development of both visual design and information design for the project. They’re a fun, powerful design technique that helps provide a framework for a successful project.</p>
<p>More tangibly, do user testing on new designs or newly created indexes, ideally at interim stages and not just at the end of the project. There were two rounds of user testing in the <span class="caps"><span class="caps"><span class="caps"><span class="caps">BBC</span></span></span></span> project, one with participants recruited by an external agency, and the second with <span class="caps"><span class="caps"><span class="caps"><span class="caps">BBC</span></span></span></span> staff who weren’t involved in the project and weren’t in the wider user experience design team.</p>
<p><strong>2. Show your numbers</strong><br />Don’t make your users guess: even if there’s only one non-alphabetic entry, show that it exists. Many indexes fudge the challenge of entries that begin with numerals by shoving them, for example, in under ”A” or “Z.” Chances are, the entries will only ever be found by serendipitous browsing or lucky guessing.</p>
<p>We used the label “0-9” because there were no entries beginning with a punctuation character. It’s not ideal because, as one participant in testing suggested, users wondered where all the numbers (1, 2, 3, etc.) would be. That said, nobody else questioned the label, so it seemed a good solution. I’ve also seen the hash symbol (“#”) used to encompass both numeric and punctuation character entries, as well as “num” for numeric-only entries.</p>
<p>Whatever it is labeled, make the numbers section or page prominent by placing it in front of the letter “A.” This is a convention in computer books that index technical concepts. In the row of letter blocks that appears at the top of every bbc.co.uk A-Z index page, “0-9” resembles the double door for Christmas Day on an advent calendar. This may seem to give it undue prominence, but doing so gives the page a better chance of being spotted and used, illustrated in image 3:</p>
<p><img alt="Lippellhelen_img03" height="172" src="http://www.boxesandarrows.com/files/banda/the_abcs_of_the_bbc_a_case_study_and_checklist/LippellHelen_img03.jpg" width="438" /><br /><em>Top of an A-Z index page, showing double size box for the numbers page</em></p>
<p>The site indexes of technology companies can be good sources of inspiration in dealing with non-standard entries; a good example is http://java.sun.com/a-z/.</p>
<p><strong>3. Acknowledge articles</strong><br />The question of how to deal with entries that start with “the,” “a,” or “an” became important for bbc.co.uk because of the sheer volume of program titles that needed to be added to the index. Eventually we decided to double-post entries under both the first letter of the article word and whatever letter the next word started with, hence entries for both “The Apprentice” and “Apprentice, The”.</p>
<p>As with the previous tip, why make users stop to think about how the indexer might choose to see the content? The beauty of an A-Z over a directory or a sitemap is that different mental models can be supported and the same thing can appear in more than one place.  This gives it an advantage over, say, the Grand Vizier’s camels.</p>
<p><strong>4. Include synonyms</strong><br />Synonyms, in the form of “See: XX” references, appear throughout the bbc.co.uk index. As in the traditional thesaurus, they are used to show equivalence between a word or phrase and its preferred term. Synonyms in an A-Z add richness to the list of entries–and can often allow you to speak your users’ language without losing the ability to call entries by their correct names. Equally, synonyms play a role in educating users as to what those correct names are.</p>
<p>For bbc.co.uk, many synonyms are used in the A-Z to provide alternative access paths to branded content. For example, the radio station Five Live is always written with a word rather than a number, but a user scrolling through the numbers page will see the pointer:</p>
<pre><code>5 Live:&amp;lt;br /&amp;gt;    See: Five Live homepage</code></pre>
<p>Synonyms are often abbreviations or acronyms of phrases, as in this entry for the well-known cricket commentary program:</p>
<pre><code>TMS:&amp;lt;br /&amp;gt;        See: Test Match Special</code></pre>
<p>Synonyms can also help users who think in terms of categories or subject areas (rather than those looking for a known item). They may not know exactly what content they want, but can be directed to something appropriate by considerate use of “See:” references. For instance:</p>
<pre><code>Family History&amp;lt;br /&amp;gt;    See: Who Do You Think You Are?</code></pre>
<p><strong>5. Properly index proper names</strong><br />People should be indexed by surname rather than first name, as per book indexing convention. There are of course some exceptions; for example, the names of monarchs (“Charles I”), certain celebrities (“Mel B”), or people from cultures where the surname appears before the first name (“Mao Tse-Tung”).</p>
<p>(The bbc.co.uk A-Z does not contain any entries for people’s names, unless they are part of a site name or program brand. Thus, there are entries for “The Jeremy Vine Show” under both “T” and “J,” but not “V.”)</p>
<p>Much more detailed information about name indexing can be found at <a href="http://www.anindexer.com/about/name/nameindex.html">Martin Tulic’s site</a>.</p>
<p><strong>6. Consider your cross-references</strong><br />In general, the bbc.co.uk A-Z avoids excessive cross-referencing, which could make already long pages less usable and less attractive to casual browsers. However, bringing closely-related concepts together can add value to the index and promote content in different places. Cross-references are shown as “See Also:” pointers in entries, as in this example from bbc.co.uk, where a country name is linked to content about the languages that are spoken there:</p>
<p>Sri Lanka</p>
<p>    Sri Lanka (Country Profile)</p>
<p>    Sri Lanka (Cricket)</p>
<p>    See Also: Sinhala (World Service)</p>
<p>    See Also: Tamil (World Service)</p>
</p>
<p><strong>7. Use qualifiers and extra information</strong><br />Besides synonyms and cross-references, there are other ways to make your index more user-friendly. Qualifiers are extra bits of information in parentheses, attached to index entries, often for clarifying concepts. For example, the bbc.co.uk A-Z qualifies dance as “(Performing Art)” in order to differentiate it from dance music, which is also covered in depth on the site.</p>
<p>Qualifiers can also be used on large sites where a subject is covered in more than one place, a particular issue for bbc.co.uk. The entry for “Comedy” is:</p>
<pre><code>Comedy&amp;lt;br /&amp;gt;     Comedy homepage&amp;lt;br /&amp;gt;     Comedy (BBC 7)&amp;lt;br /&amp;gt;     Comedy (BBC Two)&amp;lt;br /&amp;gt;     Comedy (BBC Film Network)&amp;lt;br /&amp;gt;     Comedy (Radio 4)</code></pre>
<p>It is also possible to supply extra information in the form of what would be called scope notes in a thesaurus. These could be useful in a website index where an entry for a new or growing brand, or an unusual concept, might benefit from further detail. For example:</p>
<pre><code>BBCi – Information about interactive TV</code></pre>
<p><strong>8. Take pride in the index</strong><br />Dealing with everything mentioned in the other seven tips will give your index a fighting chance of being successful. None of it will matter, however, if users cannot find it.</p>
<p>Make the A-Z index available from all parts of your site if you can, preferably linking to it in a consistent, prominent place on every page. This might be in a toolbar or as part of the navigation. Whatever you do, don’t follow the example of some sites that put the link to the A-Z at the bottom of the page, in tiny size 1 font, with other links that very few people (apart from lawyers) ever see, let alone click on.</p>
<p>Take pride in your index, and be like the owners of this second-hand bookshop in London, who clearly wanted customers to know they cared about organizing their stock!</p>
<p><img alt="Lippellhelen_img04" height="380" src="http://www.boxesandarrows.com/files/banda/the_abcs_of_the_bbc_a_case_study_and_checklist/LippellHelen_img04.jpg" width="479" /><br /><em>Sign from a London bookshop promoting beautifully organized stock</em></p>
<p><morebox><strong>For more information</strong></p>
<p>There is a wealth of information on the web about indexes of all sorts, not just A-Zs.</p>
<p>The following are good places to start:<br /><a href="http://digital-web.com/articles/a_z_indexes_site_searching/">A-Z Indexes to Enhance Site Searching</a><br /><a href="http://www.sitepoint.com/article/a-z-indexes">A-Z Website Indexes Explained</a><br /><a href="http://www.iawiki.net/WebSiteIndexes">IA Wiki WebSiteIndexes</a><br /><a  href="http://www.ics.ltsn.ac.uk/books/reviewed/040.htm">Beyond Book Indexing</a></p>
<p>There are many books, especially from the field of library science, that cover the concepts and practice of indexing. I have found the <a href="http://eu.wiley.com/WileyCDA/WileyTitle/productCd-0471014494.html">Art of Indexing</a>, by Larry Bonura, a useful introduction.</p>
<p>Lastly, <a href="http://www.bl.uk/acatalog/Catalogue_ISBN_0712347291_139.html">Indexers and Indexes in Fact &#38; Fiction</a>, edited by Hazel K. Bell, is a reminder that indexing isn’t just about dry, meticulous text analysis. The book contains dozens of idiosyncratic, political, useless, or just plainly wrong-headed indexes.</morebox></p>
]]></content:encoded>
			<wfw:commentRss>http://boxesandarrows.com/the-abcs-of-the-bbc-a-case-study-and-checklist/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Developing and Creatively Leveraging Hierarchical Metadata and Taxonomy</title>
		<link>http://boxesandarrows.com/developing-and-creatively-leveraging-hierarchical-metadata-and-taxonomy/</link>
		<comments>http://boxesandarrows.com/developing-and-creatively-leveraging-hierarchical-metadata-and-taxonomy/#comments</comments>
		<pubDate>Sun, 23 May 2004 04:27:58 +0000</pubDate>
		<dc:creator>Christian Ricci</dc:creator>
				<category><![CDATA[Findability]]></category>
		<category><![CDATA[Methods]]></category>
		<category><![CDATA[Process and Methods]]></category>
		<category><![CDATA[Special topic: Search and Metadata]]></category>

		<guid isPermaLink="false">http://boxesandarrows.com/developing-and-creatively-leveraging-hierarchical-metadata-and-taxonomy/</guid>
		<description><![CDATA[In content metadata and hierarchies, you will often find a goldmine of implicit and explicit data that you can leverage to <i>creatively</i> contextualize content. After a brief introduction on taxonomy and metadata, this article focuses on finding and utilizing such relationships in hierarchies.]]></description>
				<content:encoded><![CDATA[<p>When confronted with projects requiring content, document or knowledge management, and presentation, more likely than not, the information architect will be expected to lead or contribute to development of the content classification requirements. And we don&#8217;t classify our content without reason.</p>
<pullquote>&#8220;In content metadata and hierarchies, you will often find a goldmine of implicit and explicit data that you can leverage to <i>creatively</i> contextualize content.&#8221;</pullquote>
<p>As site creators, it would be time-consuming, expensive, and contentious to develop and maintain the necessary infrastructure and processes to manage unorganized content. As site users, it would be maddening to try to sift through the links in Yahoo&#8217;s directory trees if contributors and reviewers hadn&#8217;t organized them ahead of time. By organizing content for presentation according to its metadata, we can &#8220;contextualize&#8221; for potential users.</p>
<p>In content metadata and hierarchies, however, you will often find a goldmine of implicit and explicit data that you can leverage to <i>creatively</i> contextualize content. Following a brief introduction on taxonomy and metadata (what I call content classification requirements), this article will focus on finding and utilizing such relationships in hierarchies.</p>
<p><span class="subhead">Content classification requirements: taxonomy and metadata</span><br />
&#8220;Taxonomy&#8221; is a terribly overused term these days. Bob Boiko, in the book <i>Content Management Bible</i>, goes so far as to call it &#8220;trendy.&#8221; Specifically, taxonomy is a hierarchical structure for the classification or organization of data. Historically used by biologists to classify plants or animals according to a set of natural relationships, in content management and information architecture, we tend to leverage taxonomies as a tool for organizing content (For additional information, see Christina Wodtke&#8217;s interview with Samantha Bailey <a href=" http://www.boxesandarrows.com/view/unraveling_the_mysteries_of_metadata_and_taxonomies">elsewhere on Boxes and Arrows</a>).</p>
<p>Metadata (data about data) describes an asset and provides us with a meaningful set of attributes that we can use to further classify or consume content. While much metadata is flat or one-dimensional in nature (e.g., size or weight), some of it is hierarchical (e.g., taxonomies), making the definition and distinction between metadata and taxonomy vague and fuzzy.</p>
<p>Collectively, I tend to manage taxonomy and metadata needs as simply content classification <i>requirements</i>; taxonomy as a means of organizing content and metadata as method of further describing it.</p>
<p>If your task, for example, is to classify independent musicians for a website and database, you may choose a taxonomy that organizes the artists by musical style and then create metadata to describe the members of the band, the year of inception, record label, discography, and geography.</p>
<p><center><a href="http://www.boxesandarrows.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_01.gif"><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_01-thumb.gif" width="400" height="199" border="0" /></a><br />
Figure 1: Sample taxonomy for classifying music and musicians (Click to enlarge).</center></p>
<p>Geography is particularly interesting as it has the potential to be hierarchical metadata itself. Let&#8217;s say your database expands to include musicians from around the world. Your hierarchy of geography could include country, state or providence, region and/or city. That&#8217;s a whole new hierarchy, perhaps even a taxonomy. Geography, by itself, has become an effective and alternative means of organizing your content. </p>
<p><center><a href="http://www.boxesandarrows.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_02.gif"><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_02-thumb.gif" width="309" height="201" border="0" /></a><br />
Figure 2: Sample geography taxonomy</center></p>
<p><span class="subhead">How to develop content classification requirements</span><br />
When developing a content classification strategy, it&#8217;s important that you know your needs, your application, and the technical limitations of your software infrastructure and content producers. It&#8217;s quite possible that your content management and delivery needs will tax the capabilities and APIs of your content management tools. It&#8217;s also quite possible that you will severely overtax the abilities and/or patience of your content producers (see sidebar). If there&#8217;s development to be done or sacrifices to be made, keep these constraints in mind as you design the application.</p>
<p>There&#8217;s a very good chance that you will need to revisit and revise your content classification strategy periodically. Life changes, business changes. It is more important that you plan for and design a mechanism and set of policies for easy adaptation, than waste countless hours fine-tuning your taxonomy here and now.</p>
<p>Therefore, here are my guidelines for development of your content classification requirements:
<ul>
<li><b>Address governance as early as possible in the design cycle. </b><br />
How will your content classification get revised and extended? How frequently? What person or team will be responsible for the maintenance of your metadata? How will you settle disputes? </li>
<li><b>Identify the scope. </b><br />
Are you organizing content for creation? Consumption? Both?  A specific department?  Some combination of departments?  Intranet?  Extranet?</p>
<p>Admittedly, it&#8217;s important that you not rule out the eventual integration of this work with other initiatives and parts of your business but consider the value of bounding your project.  Scope creep is a serious risk as you begin cataloging and classifying your content.  Instead of solving all the organization&#8217;s problems right now, invest time and energy in defining how your classification requirements can and will change over time. </li>
<li><b>How does the scope of this application relate to your business organization. </b><br />
Are you striving for the development of an enterprise taxonomy? Enterprise taxonomies are a popular concept right now, especially in organizations initiating development of 3rd and 4th generation intranets. An enterprise taxonomy is typically perceived as a single, monolithic, corporation-wide, structure for the classification of all things related to your business. </p>
<p>Unfortunately, development of an enterprise taxonomy requires the careful coordination, and cooperation, of departments within your organization. Will you be able to coordinate the efforts, language and needs of your IT and sales departments? If so, kudos to you. If not, welcome to the real world.</p>
<p>Disparate divisions within an organization will often use different terms when referring to the same thing. This means, more often than not, the scope of your content classification requirements will be constrained to some subset of the organization or to a particular business process. </li>
<li><b>Catalog your content. </b><br />
The exercise of cataloging your content can be very informative. A lot of guidance, limitations, and input can be achieved during this process. More often than not, your classification requirements will vary by content class or type. </li>
<li><b>Focus on developing good criteria for the definition and extension of your metadata and taxonomy. </b><br />
Good rules will enable those tasked with management to respond quickly and with reason to requests for adaptation. </li>
</ul>
<p><span class="subhead">Organizing content for management and delivery</span><br />
So now we can build a strategy for classifying the content we hope to manage and we have attributes. Now, we need a method for presenting our content. Strategies for getting content from a management infrastructure to the delivery framework (internet, intranet, extranet, client-server application, XML-web services, etc.) vary. Here are two common techniques:</p>
<p><b>1.  Universal Hierarchy</b><br />
A single hierarchy could be used to store and deliver content. When content contributors utilize the content management system, they add, remove, and manage content in a structure that closely resembles the navigation and hierarchy of the delivery framework (your website or application). The navigation structure <i>is</i> your taxonomy.</p>
<p><center><a href="http://www.boxesandarrows.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_03.gif"><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_03-thumb.gif" width="400" height="157" border="0" /></a><br />
Figure 3: Organizing content for delivery by Universal Hierarchy (Click to enlarge)</center></p>
<p>This method is conceptually simple and makes it quite easy to dynamically build your navigation from knowledge of this hierarchy. However, this model does have drawbacks:
<ul>
<li>Every time you reorganize the website, the organization of content in your management application shifts. Admittedly, this isn&#8217;t much of a drawback if you&#8217;re managing content for one moderately sized site or if your team of contributors is small.</li>
<li>It is difficult to reuse content in this structure. If you hope to reuse assets throughout your website, where are they organized in this structure?</li>
<li>In an environment with many contributors and diverse security requirements, organizing content (in the management application) in another way, say by contributor or by department, may be more intuitive.</li>
</ul>
<p><b>2. Content Mapping</b><br />
A more robust, albeit more complex, method of managing content is to maintain structures and metadata in the content management application that is independent of the delivery system&#8217;s organization (navigation).</p>
<p><center><a href="http://www.boxesandarrows.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_04.gif"><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_04-thumb.gif" width="400" height="168" border="0" /></a><br />
Figure 4: Organizing content for delivery by Content Mapping (Click to enlarge)</center></p>
<p>Content is organized, at the source, as may be required by your security, workflow, or organizational needs. Perhaps your data lives in a content management system or database where different organizational mechanisms exist.  Unfortunately, oftentimes the navigation for your consuming application (the presentation framework) is managed by some other means.</p>
<p>By some rule or algorithm, leveraging your content classification data, material gets &#8220;mapped&#8221; to the presentation framework. See the example below for an application of this model.</p>
<p>This model is rich with possibilities:
<ul>
<li>There may be more than one way to organize content (think: content reuse). Given the same set of content, same set of classification criteria, but multiple algorithms, we can now build a delivery framework that allows for many methods of organization. </li>
<li>You no longer need to reorganize your content management application to change the delivery application. Just the algorithms (mappings) change. </li>
</ul>
<p>However:
<ul>
<li>If you hope to build your navigation dynamically, often you&#8217;ll need to build a tool or alternate hierarchy. You may not find much value in the content&#8217;s taxonomy. </li>
<li>Content, in your management environment, may be orphaned in your presentation framework if there are no rules mapping to an accessible part of the site. </li>
<li>Parts of the site may only be sparsely populated. It may not be readily obvious that you are creating gaps (with little or no content) in your site. </li>
</ul>
<p>While powerful, this technique can be difficult to administer without having a fairly comprehensive understanding of the site design and algorithms for &#8220;mapping.&#8221;</p>
<p><span class="subhead">Creative contextualization of content using hierarchical metadata or taxonomy</span><br />
Assuming there are hierarchical structures within your content classification system, there is a very good chance that valuable information exists in the hierarchy. By taking advantage of relationships within your hierarchical metadata structures, richer algorithms may be developed for your content delivery framework.</p>
<p>Lets identify some of these relationships and how you can leverage them:</p>
<ul>
<li><b>Ancestors</b><br />
In the example below, North America is an ancestor of America, which is an ancestor of West (and Washington &#8211; WA). Ancestors can be valuable because content classified under an ancestor node may have relevance (albeit with less specificity) to child nodes.</p>
<p><i>If you are looking at content classified under America, it&#8217;s quite possible that there is relevance in the information classified under North America.</i></li>
<li><b>Descendents</b><br />
In the example below, Canada is a descendent of North America. Descendents have value because content classified under a child or descendent node may have greater specificity. This content provides consumers of your delivery framework (users of your website or application) with a means of realizing granularity in the data. </p>
<p><i>A website visitor reading about your corporation&#8217;s news and events relevant to North America may appreciate that there is new specific to Canada or California.</i></li>
<p><center><a href="http://www.boxesandarrows.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_05.gif"><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_05-thumb.gif" width="400" height="220" border="0" /></a><br />
Figure 5: Nested sets within a geography taxonomy (Click to enlarge)</center></p>
<li><b>Nested Sets</b><br />
Nested sets are the union of data classified under a particular node and all of its descendents. The example, above, shows three sets. One of those sets is North America and all the nodes within it. Content classified under nodes within a single set may have relevance because they are related by something inherent in the structure (they&#8217;re all part of the same ancestor).</p>
<p><i>If you hope to convey, in a single &#8220;view&#8221; or page, contact information for all of your offices in North America, then you want contact information classified under North America and all its descendent nodes (i.e., within the North America nested set). If the user wants to see a summary list of contact information within a particular region, they navigate to a node of greater specificity (a descendent).</i></li>
<li><b>Peers</b><br />
Peer nodes exist within a nested set of the hierarchy. Peers have equal depth within a particular nested set of the hierarchy.</p>
<p><i>Borrowing from the example hierarchy, below, if a user has navigated to content classified under Educational Toys (at depth 2 of the Toys nested set), you may find it valuable to provide them visibility into Plush Toys. These topics share context (Toys), but by providing visibility into peer nodes, you convey the breadth of your toy inventory.</i></li>
</ul>
<p><!-- jen inserted slash-ul here --></p>
<p><center><a href="http://www.boxesandarrows.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_06.gif"><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/developing_and_creatively_leveraging_hierarchical_metadata_and_taxonomy/ricci_06-thumb.gif" width="400" height="230" border="0" /></a><br />
Figure 6: Set and depth in a product taxonomy (Click to enlarge)</center></p>
<p>So what&#8217;s the point? If you&#8217;re building algorithms that map content, from your database or content management system to a delivery framework (website or application), it may be both necessary and beneficial to consider content by these relationships. </p>
<p>For example, let&#8217;s say the assets in your content management application are organized by type (press release, white paper, image, etc.) and corporate division; it&#8217;s an easy and logical choice that allows you to manage security in the content management application at the department level, but it doesn&#8217;t accurately reflect the organization of your internet site.</p>
<pullquote>
<b>Additional Guidance</b></p>
<ul>
<li>Avoid burdening your content contributors with the maintenance of unnecessary metadata; define only as much metadata as you require for the effective presentation of your assets.</li>
<li>Whenever possible, leverage the information you already have. Identify, and aim to leverage, implicit relationships in the content classification criteria you already collect. </li>
<li>If you’re building your own content classification software, make it extensible – it shouldn’t be necessary for you to revisit the application whenever classification needs vary.</li>
</ul>
</pullquote>
<p>It&#8217;s your task to map content from the source (content management application) to the presentation system (internet delivery framework).  If several parts of your organization are capable of producing press releases, and these press releases are stored in department-specific areas of the content management application, how do you get them on the site?</p>
<p>Example &#8220;algorithms&#8221; for populating the press release and investor news sections of your internet site may read like this:  &#8220;All press release type assets, in a &#8220;ready&#8221; state (leveraging metadata), except those in the investor relations nested set are queried, sorted in descending order by publish date and rendered in the Press Releases section.&#8221;  Additionally, &#8220;All press release type assets, in a &#8220;ready&#8221; state, in the investor relations nested set are queried, sorted in descending order by publish date and rendered under Investor Relations > News.&#8221; </p>
<p>By leveraging the more complex relationships available in hierarchies, we facilitate the presentation (contextualization) of content without impacting it&#8217;s organization.  Too frequently, contextualization algorithms neglect implicitly derived information available in hierarchies. Narrowly scoped queries are used to relate content to its place in the delivery framework. </p>
<p>It&#8217;s up to the information architect to find and realize the value of these relationships when developing a content delivery framework. The opportunity for improved usability and greater content visibility using these relationships is tremendous and oftentimes supports the mission of an IA.</p>
<p><span class="subhead">Technical issues</span><br />
Unfortunately, knowing about these relationships is not enough. Pre-assembled content management and delivery frameworks may not provide the APIs or schema necessary to effectively leverage this knowledge. It&#8217;s important to understand what you want, and then to find and understand the limitations of your tools.</p>
<p>Maintaining and consuming hierarchical data in tabular relational database management systems (RDBMS) systems can be difficult, but with creative database administration, the relationships can be assembled in temporary tables and/or discovered by stored procedures.</p>
<p>This is valuable stuff. If it improves content delivery, consider the cost of developing a method for accessing these relationships if it doesn&#8217;t already exist.</p>
<p><span class="subhead">Summary</span><br />
When developing your content classification strategies (for both content management and presentation), make a point of evaluating your hierarchical structures for relationships and valuable data hidden within them. Relationships between nodes of a hierarchy are complex, but often mirror the perception of your users.</p>
<p>By leveraging these relationships, you can broaden the scope of your queries. Broaden the scope of data presented to the users, and drive users toward tangential content.
<p><img src="http://www-boxesandarrows-com.zippykid.netdna-cdn.com/files/banda/art_end.gif" alt="" title="" width="8" height="8" /></p>
<p><morebox><br />
<b>For More Information:</b></p>
<ul>
<li>Boiko, Bob. <a href="http://www.amazon.com/exec/obidos/tg/detail/-/076454862X/qid=1085374148/sr=8-1/ref=pd_ka_1/104-0384356-7452721?v=glance&#038;s=books&#038;n=507846">Content Management Bible</a>. John Wiley &#038; Sons, 2001.</li>
<li>Aifia Tools <a href="http://aifia.org/tools/">http://aifia.org/tools/</a></li>
<li>Series on controlled vocabularies and faceted classification by <a href="http://www.boxesandarrows.com/people/archives/karl_fast_fred_leise_and_mike_steckel.php">Karl Fast, Fred Leise and Mike Steckel</a></li>
<ol>
<li><a href="http://www.boxesandarrows.com/archives/all_about_facets_controlled_vocabularies.php">All About Facets &#038; Controlled Vocabularies</a></li>
<li><a href="http://www.boxesandarrows.com/archives/what_is_a_controlled_vocabulary.php">What is a Controlled Vocabulary?</a>
<li><a href="http://www.boxesandarrows.com/archives/creating_a_controlled_vocabulary.php">Creating a Controlled Vocabulary</a></p>
<li><a href="http://www.boxesandarrows.com/archives/synonym_rings_and_authority_files.php">Synonym Rings and Authority Files</a></li>
<li><a href="http://www.boxesandarrows.com/archives/controlled_vocabularies_a_glossothesaurus.php">Controlled Vocabularies: A Glosso-Thesaurus</a></ol>
</ul>
<p></morebox></p>
<p><biobox><br />
<a href="http://www.boxesandarrows.com//people/archives/christian_ricci.php">Christian Ricci</a> is a consultant, application developer, web designer, and project manager with over 11 years of experience in software design and development, network and server administration, and software project management and engineering. As a Senior Solutions Architect for Saillant Consulting Group, Chris has lead portal, content, and document management projects for Qualcomm, Intermountain Health Care, J.D. Edwards, EAS, and the Denver Post.</p>
<p>Site: <a href="http://www.chiamonkey.com/">http://www.chiamonkey.com/</a><br />
Resume: <a href="http://www.chiamonkey.com/mealticket">http://www.chiamonkey.com/mealticket</a></biobox></p>
]]></content:encoded>
			<wfw:commentRss>http://boxesandarrows.com/developing-and-creatively-leveraging-hierarchical-metadata-and-taxonomy/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
