Findability Archives - Boxes and Arrows

Forget the Trail of Breadcrumbs

Diana Sonis — Mon, 05 Jul 2021 14:05:32 +0000

Enterprises often have a simplistic understanding of navigational structures in UX Design. Companies shy away from messing with known organizational schemas for fear that their users or customers will become confused and run away. We don’t give our users enough credit.

As a result, most software navigational structures either reflect hierarchical departmental company/brand organization (because how can users be confused by that?), or a very top-heavy list of bucketed themes loosely based on general product “themes” (hello Amazon!).

Besides thinking about the actual organizational structure, we also know that a user must have some guideposts to retrace their steps if lost. Thus, today, “leaving a trail of breadcrumbs” really means allowing the user to get back to where they came from in a linear path by backtracking their steps. How cumbersome it is to hierarchically retrace steps or click the browser back button like a well trained monkey.

Taking all this into account, we got to thinking: What if a navigational structure could shapeshift to fit a user’s intention at any moment at any place? What if a user could apparate to any point of the website/product from their current location based on their intention/need? Isn’t that what hyperlinking is all about anyway?

To do this, a proposed navigational structure would have to (1) understand what the user intends to do at all times and (2) be flexible enough to transport the user to any place in the software based on the current user intention.

Let’s examine these two requirements separately.

1. Understanding what the user intends to do

Understanding user intentions requires fulfilling the pervasive user expectation that digital properties should behave in a more “human” way, where websites, products and services we design have an “organic conversation” with the user.

An organic conversation is a bridge of understanding between the customer and brand within the digital medium where they meet; a conversation that flows naturally just as between two friends.

This organic conversation can be created by designing intent-based navigation through a process we call Designing for Customer Intentions. To do this, we must understand, document, and design from the viewpoint of what customers intend to do when interacting with a digital medium, rather than cataloging and rearranging the content businesses currently have to be more “user-friendly.”

An example of this can be seen on Patagonia’s individual product pages. After all the expected product information, there are doorways into learning how and where the item is made. Given the well-documented eco-consciousness of Patagonia customers, it’s likely that one of their intentions is: when I purchase a product, I want to ensure/learn about how it’s made. Patagonia was started on the premises of, and intentionally appeals to, customers’ minimalist or sustainable ethos.

Patagonia product page

To satisfy their user’s intent to discover the impact of available goods for purchase, the navigational structure of the page includes an “Impact” section, detailing fair trade, materials, and other informational elements their customer may care about. This reinforces the customer’s perception that the business “understands” them. This, in turn, leads to increased customer feelings of autonomy and competence regarding their purchase, and therefore to higher purchase rates and increased loyalty.

2. Apparating the user

Imagine wandering around a well designed house. Rarely is there a need to go back to the front door to find the pantry or the bathroom, because there are well-designed ways to take you to the right room at the right time. Similarly, there is no reason to force the user to retrace their steps in a digital product to move forward.

If we think about navigation as an organic conversation between the digital property and the user, navigation takes on new meaning. It’s no longer just a well-organized hierarchy of content, but a lateral, intentions-based structure that shifts to accommodate user intentions in the moment.

You’ll see examples of this with text-heavy content sites like Wikipedia where you have contextual links to further research related content. Likewise, in the product space, you can observe this lateral, intention-based navigational structure in good CRMs (customer relationship management systems) that allow the user to access all needed functions without having to go back to the main dashboard.

Why Navigation Needs to Evolve

Creating an intentions-based navigation is required for our software navigational structures to evolve with our users’ expectations.

Users expect digital products to understand them as other humans do and we must innovate to fulfill, and eventually exceed, this expectation.

Let’s imagine, for a moment, that there are generally two types of personas that describe 70-80% of all users of a digital property or website: 1-people with a singular goal who know what they want; 2-explorers who meander and delight in discovery.

Our solution so far, has been to rely on breadcrumbs, but no more! Whether exploring or focused, users expect technology to “understand ” what they want – which really means people expect the business to magically guess their intentions, and help them apparate (or transport) to wherever they need to go to fulfill those intentions.

Designing intentions-based navigation structures helps the customer and the business. Businesses are better served when users dive deeper into their online ecosystems through continuously fulfilled intentions. Ever take 3 minutes to watch a YouTube video, only to come up 45 minutes later from the rabbit hole of those similar videos offered up on the right side? You have intentions-based navigation to thank.

Intentional navigational design serves the business because users enjoy autonomy and competence in using the site or service. They become loyal users, intrinsically motivated to return to this business again and again. Tracking user navigational patterns further allows the business to learn more about the user, which in turn allows them to provide better service.

Defining User Intentions Is Key

Designing for customer intentions decreases the amount of information that the customer needs to consume, decreases their cognitive load, increases their success rate of fulfilling their goals, and improves the overall positive perception of their experience with a business. Over time this creates happy, loyal customers.

We will explore how to design for user intentions in Part II, the key takeaway here is:

Serving up the right content at the right moment to meet, or exceed users’ goals and expectations, UX experts serve both the user and the business in a more thoughtful, profitable way.

UX designers create impactful spaces that sit comfortably at the intersection of business, technology, and design by creating flexible, adaptable navigational structures around well-defined user intentions. This helps the designer create and retain loyal customers. A win-win.

Photo by Ashley Batz on Unsplash

Evolve or Die

The Designing for Customer Intentions method is not the only way to evolve the design of digital product navigation. For example, there are plenty of articles written about “jobs to be done” and other methods. We encourage you to research these methods and develop your own viewpoint on how to evolve your practice. The main idea is to evolve, or risk losing your users to other, more intentional, competitors.

So, dear reader, how will you choose to humanize navigation to align with what users intend to do in your digital space? The choice is yours.

Featured photo Photo by Susan Q Yin on Unsplash.

The post Forget the Trail of Breadcrumbs appeared first on Boxes and Arrows.

Taxonomies: Connecting Users to Content

Heather Hedden — Tue, 06 Apr 2021 02:00:00 +0000

Taxonomies may be thought of as hierarchies of categories to group and organize information to be found when browsing, or as a structured set of terms used to tag content so that it can be retrieved efficiently and accurately. Sometimes the same taxonomy may serve both purposes, and sometimes two different taxonomies are used, one for each purpose, for the same content or site.

Taxonomies are not new, in fact there has been a lot written about them, including an informative series of six articles here in Boxes and Arrows by Grace Lau in 2015. An area that still needs to be better understood is exactly how taxonomies should be designed and implemented to be most effective.

Suiting Users Needs

The previous series of articles on taxonomy by Lau addresses many important points about taxonomies including building the business case for a taxonomy, planning a taxonomy, and taxonomy governance. In the first article of the series, “Planning a Taxonomy Project,” she states: “Understanding the users and their tasks and needs is a foundation for all things UX. Taxonomy building is not any different. …Who are the users? What are they trying to do? How do they currently tackle this problem? What works and what doesn’t? Watch, observe, and listen to their experience.”

In this article, I will explain the role of a taxonomy as a tool that connects users to content.

The taxonomy sits between users and content, and information flows in both directions.

Understanding the users is of central importance, so let’s consider specifically two techniques we can use to make a taxonomy more suitable for its users: (1) adapting the names or labels of the taxonomy concepts (terms) to the language of the users, and (2) adapting the categorization hierarchy to the expectation of the users. The complexity is to do this for multiple different users with the same taxonomy for the same content.

Different Options for Concept Labels

Different users will call the same thing by different names, whether it’s simple synonyms, such as Doctors vs. Physicians or Cars vs. Automobiles, or words or phrases that are not exact synonyms, but close enough,such as Computer security, Cybersecurity, Information security, IT security.

Different labels for the same concept: Computer security, Information security, Cybersecurity, and IT security

Taxonomies, in contrast to mere navigation labels, make use of such “alternative labels” for each concept, also known as non-preferred terms in thesauri. These are colloquially referred to as synonyms, but they are not exactly synonyms; they are labels for concepts that are sufficiently equivalent for the context of the content and the taxonomy. Thus, users searching on any various alternative labels will retrieve the same concept and it’s associated content.

It is a design choice whether the alternative labels are displayed before redirecting to the concept with the preferred label, or if the redirect is without a display and the user is taken directly to the tagged content set. Displaying alternative labels is educational for repeat visitors, whereas no display of alternative labels to end-users provides a clean, quick user experience. Users may not be aware that their chosen name was actually “alternative” and not “preferred.”

When a taxonomy is displayed for hierarchical browsing, only the preferred labels for each concept can be displayed. Designation of a preferred synonym as the label should reflect the wording preference of the majority of users.

If there are two distinct sets of users, such as employees and customers, where a number of preferred labels vary, it is possible to create two display versions of the taxonomy. This can be a little more complicated to implement because commercial taxonomy management software typically supports just one preferred (i.e. display) label per concept by default. You may need to create two separate taxonomies and link them at equivalent concepts.

Different Options for Categorization

Different users may categorize differently and will look for the same thing in different places. Lau’s articles gave the example of different users of a kitchen wanting to group different ingredients differently. This would certainly be a challenge in sharing the same physical space.

Fortunately, taxonomies are used to describe digital space so there is flexibility. While a physical object can exist in only one place in a kitchen, a library shelf, or a store shelf, the same taxonomy concept representing an idea may exist in more than one place in a taxonomy hierarchy.

In another example, some people might categorize Financing agreements under Financial documents and some might put the category under Contracts.Thus, we can have the taxonomy concepts of Financing agreements appear as both a narrower concepts of Financial documents and as a narrower concept of Contracts, and all the same tagged documents will be found in both locations. This is what taxonomists call “polyhierarchy.”

Taxonomy excerpt of the concept Financial documents in a polyhierarchy, appearing under both Contracts and Financial documents.

One thing to keep in mind is that polyhierarchy is appropriate for hierarchical taxonomies, not for faceted taxonomies of attributes or filters (such as ecommerce facets of Size, Color, Material, and Style), where the same concept should exist in only one facet.

Methods of Obtaining User Input

The methods to develop a taxonomy involving users have some similarities and some differences compared to other UX methods. Card sorting can be used to gather user input for taxonomies, but it is effective only for 2-3 levels of a hierarchical taxonomy and is not as effective for designing facets, where the challenge is to identify ways to describe not ways to categorize. Some hierarchical taxonomies have many more levels, so card sorting is most practical for just the top levels, or else it would become too time-consuming for the multiple hierarchies at each level.Taxonomies are more extensive than just the navigation structure of a website.

Users of a taxonomy include both those who are looking for information and those who would be using the taxonomy to tag content. Representatives of these two different user groups should be interviewed with different questions. For example, those who need to retrieve content may be asked questions around the challenges in finding content and search terms; those who tag content may be asked questions about challenges in finding appropriate terms for tagging. Similarly, user testing of the draft taxonomy should also involve both uses of tagging and uses of retrieval.

Content management users, especially those dealing with particular subject domains, may be asked to submit lists of suggested terms that fall into deeper levels of the taxonomy. Those submitting suggested terms should be provided with clear guidelines, that the terms are for tagging content, so that they do not suggest terms that are too specific and not reflected in the actual content. These terms then should be reviewed and discussed with the taxonomist to make sure that they are suitable for the taxonomy.

Another method to gather user input indirectly for a taxonomy is to analyze search logs to identify what words and phrases users have been entering into a search box to find content. These words and phrases should be considered for alternative labels (synonyms) for taxonomy concepts, and possibly for additional concepts in the taxonomy, if warranted by the content.

Conclusions

While UX research is a formal job role, taxonomy research is not, although there are standard practices. Taxonomy research is rolled into the overall taxonomy design and creation job. Because taxonomies are based on the content they are tagged to, taxonomy creators may fall into the trap of exclusively focusing on making the taxonomy reflect the content without also considering the need of making the taxonomy suitable for its users. Taxonomy user research may not be as formal or extensive as other UX research, but it is critical to the success of a taxonomy.

Heather Hedden will be conducting a workshop on this topic at the 2021 Information Architecture Conference. Participants will learn taxonomy creation principles and how to address the issues of designing a taxonomy to serve users.

Featured photo by KOBU Agency on Unsplash.

Additional images by Linda Ramirez and Heather Hedden.

The post Taxonomies: Connecting Users to Content appeared first on Boxes and Arrows.

Stop Counting Clicks

Robert Goesch — Mon, 12 Oct 2020 18:00:00 +0000

Every user interaction is a decision. Every decision can lead to an exit. So the more options we offer, the more exit opportunities we create, which will reduce the probability of conversion. Right? Well…

In fact, the number of interactions a user makes is in no way directly related to conversion rates. It might be a surprise, but there is no statistical evidence to prove that this widely held belief is true. When establishing the amount of clicks that are appropriate for a task, it actually solely depends on the requirements regarding complexity, security, and usability. In this article, we’re going to share with you how we use these requirements to assess how many clicks are appropriate on a page. Once we started looking at clicks through this lens, we were able to increase conversion, reduce task time, and increase customer satisfaction.

The 3-click rule is dead

The “3-Click Rule” has been causing a ruckus for decades. In 2001, Jeffrey Zeldman suggested in his book »Taking Your Talent to the Web« that all information should be available on a website within three clicks. If you take a look at the state that web design was in back then, this isn’t a big surprise. It seemed like the more information that was on the page, the better. At that time of course, the data on interactions with digital services was quite scarce.

And yet, a full 13 years after Zeldman, we stumble onto this rule again. This time it’s Marissa Mayer, then CEO of Yahoo, before that Vice President at Google. She said in 2014, “Once you’re in the app, is it two taps to do anything you want to do? If not, time to redesign the app”. Back then, she was responsible for the redesign of the Flickr app, and it was considered a benchmark by many. This was obviously leaps and bounds better than the apps of 2001, but again, the quality of the product was attributed to the number of clicks the user needs to reach their goal. And still today, in 2020, I hear on the phone: “But that’s an extra click”. What’s fascinating, is that if we take the time to look at the data, it shows us that the number of clicks has absolutely nothing to do with the success of your product.

Numbers show: People click. And it doesn’t matter how much.

As early as 2003, a study by Joshua Porter found that the number of clicks is in no way related to the success of users finding the content they are looking for. It’s also not related to the user’s satisfaction with our product. Let me say that again: A longer process (more clicks) for the user doesn’t result in a higher bounce rate or in elevated dissatisfaction. And what’s really interesting, we’ve found similar results with our own projects.

Here at DUMBO, we designed a test that examined the bounce rate of 150,000 sessions. Within their session, each user had to answer a series of questions in six steps. This meant making at least six clicks to reach the desired result. Once the first step had been completed and the process started, the bounce rate remained constant at just one percent for each subsequent step. That was a true eye-opener for us. The remarkable thing about the figures was not only that the bounce rate did not increase with each click, but that the bounce rate itself was so low.

When it comes to designing a checkout process, different approaches have been in hot competition for years. You’ll see everything from one-page checkouts and accordions to processes with several steps. We recently completed a redesign of the booking process for a major airline, and are happy to report that we achieved an increased conversion rate by increasing the number of steps in the process. The number of first-time bookings rose by ten percent. The data also showed that the time it took for the user to complete their booking was significantly reduced. Although customers had to go through nine steps instead of the previous six, the average booking time fell from 6:48 minutes to 3:48 minutes, more clicks but twice as fast.

One more click, for less complexity

Every time we test processes that have the same content presented in a different amount of steps (and clicks), the results are the same. In a qualitative survey, most users stated that they found the process with more steps longer, but much easier to use. However, interestingly, the data usually proves that the same users actually completed the multi-step variant faster than the one with fewer steps. Additional steps simplify and speed up complex processes.

This seems counter intuitive, but it actually makes a lot of sense when you think about it. By giving the user more clicks, you’re reducing the chances that they will feel overwhelmed or stressed. When stressed, people tend to react defensively. We all know this feeling from filing our tax returns. We put it off until we have no other choice because it’s complicated and time consuming. Apps like TaxFix are successful because they break down the complex form into understandable questions that can be answered in seconds.

The more complex a process, the more important it is to break it down into simple steps. But how many steps do we need? And what’s too big or too small for one step? Our own rule of thumb is: all information must serve a purpose without the possibility to break it down further. Let’s explain the theory by taking a look at an example from a checkout process. Which step makes it unmistakably clear what has to be done, without the possibility to break the step down even further?

Fill out the form fields to complete your order.
Enter the payment details to order the product.
Choose a payment method you want to use to pay for the product.
Select the bank of your credit card.

The correct answer here is 3. The request describes one simple task in order to fulfil a real purpose. You can expect to be prompted by credit card, Paypal, invoice or other. 1 and 2 are less clear. They are too large and ambiguous. It would be difficult to anticipate the details that would be required. 4 is too specific, there wouldn’t be a clear purpose that this step would fulfill. It wouldn’t help us get closer to our goal.

So how do you figure out if you need to reduce the complexity of a task by adding more steps/clicks? By weighing up the pros and cons: Is the perceived effort (a longer task) or is the actual cognitive effort the greater hurdle for our application? In general we can say: the higher the cognitive effort, the more clicks we should dedicate to it.

One more click, for more security

The higher the risk, the greater the need for control. For example, whenever we as users make an investment that is larger than we are used to, we look for security. And we feel secure when we feel we are in control of the situation. With more steps, we are able to anticipate the effect our actions have. We can also reinforce this feeling with UX design through a high communication density. Including chapter sheets, milestones and summaries to reduce the speed, but increase the feeling of control. More clicks for more security.

Is it always the case that the higher the security risks, the more we must increase the communication density? If only it were that simple. A high communication density also means more friction. In some cases, we want to reduce any and all friction. For instance, when we recently developed a crypto wallet, we had professional traders needing to complete transactions involving millions of dollars, and fast. Every second counts with their order. Every bit of friction, every extra step can cost money, a lot of money. So we had to make it possible without any control mechanism in this situation. What’s the difference? These users are very familiar with this process, they know what they are doing and they do it all day long. The need for control mechanisms therefore decreases the more experienced users are in the situation. Depending on the context and target group, well-intentioned measures may even be perceived as an obstacle.

Most interfaces developed in product design tend to be aimed at private users. Often it’s users who aren’t familiar with the system or who will only go through it once. And every single click can create a feeling of control. It’s important to consider how fast the system can go without losing the user. Does our application need more speed or more control? The more control, the more clicks.

One more click, for better usability

If you think about it, many interaction patterns were developed from when we used to use a mouse and keyboard to navigate our large desktop monitors. This of course was a major advancement over the conventional remote control we used for our TVs (and still do for our Smart TVs today). So when smartphones were introduced, so were gestures (swiping, tapping, etc.) to navigate around on the small, handheld screens.

Anyone who has to fill out the so-called “One Page Checkout” on mobile devices knows how nerve-racking this undertaking can be. Once we activate an input field, the keyboard covers one third of the screen. We then have to use our two thumbs for each input. Auto correct tries to help us but ends up driving us insane, and while we scroll to see what’s left to fill out, we lose sight of the beginning of the form. It’s not fun.

Mobile First design forces us to rethink. We counter the restrictions of the device and the context of the interaction with maximum reduction. We know that the space on the screen is very limited, the time window of attention decreases, and the risk of distraction increases. It is therefore important to keep the obstacles as low as possible, to reduce the cognitive effort to an absolute minimum and to make the interaction as effective as possible.

All these aspects force us to design tasks sequentially with more clicks and thus to reduce the number of inputs per screen. Usability is related to cognitive effort and the need for control. If an application is to become simple and secure, device-specific optimization becomes important. Here, too, the context determines whether an increased number of clicks is beneficial to usability. As a rule of thumb, the more limited the end device is, the more clicks the process requires.

Conclusion

The moral of the story? Let’s stop counting clicks. The speed, conversion rates, and user satisfaction for your product are in no way connected to the number of clicks a user makes. And once we start limiting clicks, our page quickly starts to look like a directory: a list of every option, tiny font, in alphabetical order. For the user, this ultimately ends up feeling like we’re looking for a needle in a haystack. This is not the experience we’re hoping to achieve.

Instead, we should focus on the human. We should zero in on how they want to use our application. The less experience they have, the higher the risk and the stranger the situation, the more stress the task entails.

Here are a few reminders moving forward:

? Reduce complexity to keep the cognitive effort low.

? Apply control mechanisms to create a feeling of security.

? Optimize for device input limitations to improve usability.

Use complexity, security, and usability as metrics to provide clarity around your product development requirements. This way, you’ll be able to create the right experience based on your specific situation, regardless of the amount of clicks ?️.

Photo by Crissy Jarvis on Unsplash

The post Stop Counting Clicks appeared first on Boxes and Arrows.

Keep the Kitchen Cabinets from Overflowing

Grace G Lau — Tue, 26 Sep 2017 08:00:43 +0000

Don’t laugh. I’m sure you’ve done this before. At the office, there’s a refrigerator cleanup every two weeks. At least I think it happens every two weeks. The office administrator sends out an email or posts a note on the fridge, warning you that things will be dumped if they’re not labeled. You’ve seen these long-forgotten food containers of who-knows-when science experiments pushed up against the back of the fridge. Same with those things that start growing in your pantry…. Don’t ask. I won’t continue. Please don’t tell my mother I had so many potatoes left.

When it comes to explaining governance, the one in the kitchen is the best example to illustrate exactly what happens when you take a taxonomy for granted. Not only do you see it, you smell it. You’ll feel it if you consume the foods way past its best by or expiration date. You’ll taste the food quality deteriorate if the ingredients used are not as fresh as they could be. What better way to illustrate ROT analysis than the five senses? This kitchen analogy doesn’t stop at organization.

Previous articles in this kitchen taxonomy series went through outlining the business case for building a taxonomy, card-sorting to generate labels, and tree-testing to assess findability. At this point, it’s an overhead project at most companies.

However, this is an important reminder: Once you’ve developed and applied that taxonomy to your content, the project is far from done. Establishing a taxonomy governance is a crucial endeavor, one that makes sure that your content or application continues to be well-maintained and performing as well as the day it launched.

We’ve asked these questions before in part 2, Planning a Taxonomy Project. Why and what will the taxonomy be used for? Who is using it? How will it be built? How will it be maintained? How will we ensure that it is properly maintained? And—of course—who will do all of this?

In this part 6, we’ll be revisiting those questions and think about how to account for taxonomy management and quality control. Remember, taxonomy governance maximizes the ROI of the taxonomy project and prevents the moldy science experiments in the pantry in the first place.

For documenting these discussions and officiating them, you should consider drafting a document like a charter to keep these decisions in line. In this charter, include the following sections:

Purpose. What value does this taxonomy bring?
People. Who makes the decisions and who manages the taxonomy?
Process. How often and how does the taxonomy get updated?

Our kitchen taxonomy came about because we have many cooks in my household. We need to be able to:

Know the name of the ingredient in English and Chinese. Specifically, we needed to know the regional differences in American English, Northern Chinese (Putonghua), Hong Kong Chinese, and even Chinglish!
Know how to find unusual ingredients. Referring to the 80-20% rule, this is the 20% of things that we don’t use often.
Know how ingredients are organized. This considers streamlining workflows so that we can find things easily when we need them.

Over time, this project has evolved. Talk about scope creep! From the initial physical classification of the items in my pantry, it has evolved to organize the printed recipes in my recipe binder and the digital copies of the recipes saved in a note-taking app.

The taxonomy scope itself has remained the same: spices and food items. Along the way, I learned that storage requirements should not be a limiting factor. There are cooking ingredients that one keeps refrigerated. Not all spices are kept at the same temperature. So now sauces, cooking oils, and other condiments used either before, during, or after cooking are also included in this scope.

Purpose: What value is your taxonomy bringing you?

Stating the value of a taxonomy in an elevator pitch is important to get everyone on the same page.

Here are some ways to consider the value of a taxonomy¹:

Search. How would a taxonomy make search better?
Navigation. How would a taxonomy support site navigation?
Standardization. How would a taxonomy standardize terminology being used to categorize content? How would a taxonomy help create a common language?
Discovery. How would a taxonomy help users discover new terms or relationships?

A statement of purpose for my kitchen taxonomy could be:

We are creating a taxonomy to enhance findability for cooking ingredients at Grace’s house.

Or:

We are creating a taxonomy to standarize the terminology being used that comes from regional differences to describe the same ingredients.

Findability and standardization here are two different goals. Determine which goal has the higher importance for its success. At the same time, don’t forget to follow through on achieving secondary goals. Break the assumption that the interim solution is the permanent solution!

After a few false starts, I decided that findability is the primary purpose of this taxonomy.

A taxonomy usually has three phases of development, notes Mark Doane²:

Set up. What you should do short term to get the taxonomy ready for use
Launch. What you should to get the taxonomy up and running
Maintenance. What you should do to keep the taxonomy relevant and useful

Once you’ve determined the primary goal, you can consider the taxonomy’s secondary goals by prioritizing users of your taxonomy and addressing their needs in a structured manner. Although most taxonomy projects tend to end at setup and launch phases of development, you should do your due diligence and keep the taxonomy as relevant and valuable with maintenance. Testing the taxonomy with each release helps validate and confirm the user’s expectation and search behavior. For more information about taxonomy validation, check out Alberta Soranzo and Dave Cooksey’s work.³

People: Who should manage the taxonomy?

When thinking about taxonomy stakeholders, consider the RACI matrix.

Who should be responsible (for the work)?
Who needs to be accountable (for ownership)?
Who needs to be consulted (provide input)?
Who needs to be informed (told after the fact)?

As people who live in the house all year long, the husband and I are responsible for the daily maintenance of the kitchen. We also participate in the daily upkeep of the pantry and kitchen activities. We do the shopping and the cooking.
We usually decide which type of soy sauce or fish sauce is purchased. Personally, I tend to pick up the brand my parents use. My in-laws don’t seem to prefer a certain brand of soy sauce over another since the brands are not what they are used in China, and they definitely don’t use fish sauce in their dishes … but they determine when it is time to buy another bag of rice. My father-in-law doesn’t go a day without rice.

As frequent users of my kitchen, my parents (who visit every once in awhile) and my in-laws (who live with us half the year) are consulted in the taxonomy. They aren’t expected to make taxonomy updates; they are consulted as subject-matter experts.

If my mother-in-law has a preference for a certain brand of rice, we’ll take it into consideration, test it, and determine whether it’s worth a long-term investment. A 25-lb of rice won’t last very long while they’re in residence, but we will need to decide whether to continue purchasing that brand when the in-laws return to China. It would not be a good investment to waste a bag of rice to feed the rice weevils.

For a small kitchen taxonomy, you wouldn’t need a full-time taxonomist, a team, or a committee to manage the taxonomy. However, you should consider the following for an enterprise taxonomy:

Editor/Taxonomist. Ideally an information architect, taxonomist, or business analyst who is familiar with the content and can manage updates, solicit feedback from end users, and integrate changes.
Team. Ideally 2-3 people trained in information architecture (from the user’s perspective) and search (from a technical perspective).
Committee. A small committee of 3-5 people to meet a couple times of year to discuss taxonomy changes and approvals.

Process: How often should the taxonomy be updated?

A kitchen taxonomy should be updated as often as necessary. That means it could change as often as every day while putting groceries away or during meal preparation.

In a company, however, this frequency could potentially cause chaos. An enterprise taxonomy should be updated on a regular schedule, according to defined rules set forth by a governance team or committee.

Part of this process is to set policies and procedures so that taxonomy updates are made in a consistent manner. This is important to prevent an arbitrary decision to move all the coffee to a new location.

I’d start with a few guidelines from Heather Hedden’s “Accidental Taxonomist”⁴ (pg. 317) and build from there:

Rules for adding, changing, moving, or deleting terms or relationships such as hierarchical relationships, alternative terms, associative (or semantic) relationships
Examples of types of changes to expect and the processes for handling such changes
Specific guidelines for handling feedback and change schedules

Then, using Mike Doane’s top ten guidelines⁵, you’d be able to build a solid starter document for taxonomy governance. These top ten are important for starting out slowly and simply—keys for successful change management. If an organization doesn’t have a taxonomy in place already, having fully-decked out guidelines at the start is a sign of the taxonomy falling flat on its side.

When I consider my own kitchen taxonomy, the rules for adding terms are pretty straightforward. It occurs whenever the in-laws are here or whenever someone decides to try out a new recipe.

In the past year, I’ve tried and experimented with making a Chinese recipe for 8-Treasure Congee (八寶粥 bābǎozhōu). The ingredients are interchangeable and easy to put together, but it’s eight items. And you know what? There is a version of this with 18 ingredients that’s touted as an extremely healthy breakfast. Eighteen! The pantry is bursting at the edges, just thinking about it.

When it comes to indicating relationships with ingredients, it gets a little complicated. Imagine having to hunt through different places in the pantry for glutinous white rice, red beans, raw peanuts, and barley. These are all used in 8-Treasure Congee, but they are also used in other recipes, including soups, desserts, and rice dishes.

What’s the best way to add these new ingredients? Should they be grouped by their ingredient type? Or should I group them together as a special functional group as I currently do for baking ingredients? What would be the most optimal way to do this, considering workflow? If there is no good answer, is this another case where there should be two homes for an ingredient? Here, it’s time to consult the subject matter experts.

Next, I’ll talk about some best practices for enterprise taxonomies. But right now, I need to schedule another quarterly pantry cleanup session before the in-laws return from China. Somehow, our collection of ramen and spam has grown out of control while they were away…

Footnotes and further reading

1. Doane, Mike. “Taxonomy Governance: Why You Need It, How It’s Done.” CMSWire (May 29, 2012): http://www.cmswire.com/cms/information-management/taxonomy-governance-why-you-need-it-how-its-done-015813.php
2. Doane, Mike. “What to do now: Immediate needs,” in Enterprise Taxonomy Governance: Practical Advice for Building and Maintaining Your Enterprise Taxonomy (Volume 1). CreateSpace Independent Publishing Platform, 2017.
3. Soranzo, Alberta and Dave Cooksey. “Testing Taxonomies.” Association for Information Science and Technology Bulletin (June 2015): https://www.asist.org/publications/bulletin/jun-15/testing-taxonomies/
4. Hedden, Heather. The accidental taxonomist. Medford, N.J.: Information Today, 2010.
5. Doane, Mike. “Process,” in Enterprise Taxonomy Governance: Practical Advice for Building and Maintaining Your Enterprise Taxonomy (Volume 1). CreateSpace Independent Publishing Platform, 2017.

SaveSave

The post Keep the Kitchen Cabinets from Overflowing appeared first on Boxes and Arrows.

Could You Hand Me the Dry Rub Please?

Grace G Lau — Tue, 29 Nov 2016 08:00:04 +0000

Tree testing is an effective technique for evaluating navigation and taxonomy. In an environment devoid of visual design and cues, tree tests are useful for assessing existing site navigation and proposed site structure changes. Using my kitchen, I devised a plan to test the findability of my kitchen’s spices and pantries.

Card sorts versus tree tests

Card sorts are about organizing; they ask where would you put it? For example, in a card sort, a task would be to put things away after a trip to the grocery story or you cleaning up after I cook. Where would you store this bag of baking soda? (Aside: Misplaced baking soda is turning out to be a common theme across my posts.)

Tasks in a tree test are instead focused on findability and labels:

Help me find .
Where should I look based on these clues?

Also known as navigation testing, tree testing strips visual aids from a web site, condensing it to a collapsible, expandable folder tree. We don’t know the contents of the folder. The tree test asks where would you look for it?

Why tree testing?

In a card sort, you’re testing for the commonality of the cards. In a tree test, you’re testing the labels, not the cards.

In an open card sort, you’re asking for the user to help you generate labels for groups that they’re creating. In a closed card sort, the labels are predetermined and all the user is doing is group the cards under those labels.

Tree testing helps assess whether those labels make sense. In fact, I tend to use this as a way to benchmark an existing navigation or to test a proposed sitemap. In retrospect, if I’d thought about this six years ago, I should have conducted a tree test every time I moved to see if the kitchen was better organized over time.

Tree testing also helps validate nomenclature with your users and confirm whether you and your users have the same mental model.

Putting the tree test together

So far, I’ve conducted a content inventory and a card sort. My next mission was to take the labels generated from the card sort and apply it to areas in my kitchen. How did I want to label the areas in my kitchen to best reflect function and workflow?

The tree or sitemap

Tree testing focuses on using sitemaps (referred to as “trees” in Treejack, a popular software for this mission) to test navigation quickly. Once you have a sitemap and an idea of your users’ key tasks, you can start planning, executing a test, and iterating with revisions. There’s no dependency on the visual designer to make changes in Photoshop or on the interaction designer to create the prototype adjustments. This is all just making sitemap changes in a spreadsheet or a mind map.

This taxonomy had to consider the physical layout of my kitchen. At the same time, card sort findings indicated that some things should be stored in the refrigerator and frequent use items should be kept at the ready. If this were digital content, for instance, where I’m crafting a taxonomy for describing recipe content, physical space is less of a factor, never mind storage requirements.

Based in these deliberations, I used a mix of general location and descriptive purpose labels as the primary level labels in my sitemap. “Next to the stove,” “in the refrigerator,” and “in the pantry” are the three locations in the first tree.

I used XMind. It’s predominately a mindmapping tool, but I use it to create sitemaps and start taxonomies. Also, it’s a great visual when you’re showing a client the depth and breadth of their site as compared to their competitors’. Just zoom out and they can see the big picture of exactly how much content they have. http://www.xmind.net/m/CU8Z

For this kitchen taxonomy, I ran two rounds of tests. As the results of round one came in, I couldn’t stop myself from restructuring my kitchen along the way and subsequently creating a round two to test the new structure. It happens. Iteration is good. When you’re putting together your UX research plan, be sure to plan and budget for iterations.

In the first round, I focused on those three locations:

Next to the stove
In the refrigerator
In the pantry

In the second round, I broadened the locations:

On the countertop, next to the stove
In the cabinet, next to the stove
In the cabinets, below the stove
In the refrigerator
In the pantry
Near the coffeemaker

The tasks

In both rounds of tree testing, the tasks I gave respondents had similar themes. The tasks were commonly-occurring scenarios in my kitchen whenever we have guests or are preparing for a yummy get-together.

	Round 1	Round 2
Common-use/Training	My sister is visiting from out-of-state and is making fried eggs for breakfast. Where should she look to find salt?	My sister is visiting from out-of-state and is making fried eggs for breakfast. Where should she look to find salt?
Wildcard	It’s breakfast smoothie time! I want chia seeds added in mine. Where would you look to find them?	It’s late and I’m craving noodles after watching Korean dramas. Where can I find ramen noodles?
Spice item	For dinner this Saturday, we’re making BBQ ribs. Help me find the dry rub.	For dinner this Saturday, we’re making BBQ ribs. Help me find the dry rub to season the ribs.
Pantry/Baking	I’m trying to perfect a recipe for a childhood favorite snack, Hong Kong-style egg waffles. For this, I’ll need custard powder. Where would you look to find this?	We’re in a Thai mood for dinner tonight so I’m making coconut rice. Where would you look to find coconut milk? Either canned or boxed is fine.
Coffee and tea	It’s time for a coffee break! Or, if you prefer, time for that afternoon caffeine boost. I’m ready for some fresh brewed coffee. Where would you look to find the coffee beans? I’ll get the water ready.	It’s time for that afternoon caffeine boost. Could you get the Chinese tea out? Any one is fine. I’ll get the water ready.
Ingredient with questionable storage	We’re eating chicken stir-fry for dinner tonight. Do you need hot sauce? Where would you look to find Sriracha sauce?	We’re eating chicken stir-fry for dinner tonight. Do you like to eat spicy? Where would you look to find Sriracha sauce?

The follow-up questions

After each task, I asked some follow-up questions to assess the ease of the task:

Overall, how difficult or easy did you find this task?
Any thoughts on how this experience could be improved?

I use a 7-point rating scale with 1 being very difficult to 7 being very easy. An open-ended question helps capture feedback right away while the task is still fresh in the user’s mind.

The recruit

Hopefully, you have a pool of participants you can send your test out to. You may want to check out Demetrius Madrigal’s Research Logistics, where he outlines how you might want to recruit participants similar to your target audience. Or you may send it around the office to whomever has 5-10 minutes to spare from watching cat videos. For my test, I had embedded it with my card sorting article and posted it across LinkedIn, Twitter, and Facebook.

The wait, the cringe, and the iteration

Once a tree test is running, the fun starts. The results start rolling in and you can start making adjustments for a second round if you want, which I did. You start seeing patterns after three people have gone through your test. Once the test reached the magic number of five participants, I made the first level of the tree broader. I expanded the top level branches to include the cabinet area next to the stove, under the stove, and the counter space next to the stove. I also designated an area for tea and coffee based the location of the coffeemaker/electric water kettle.

The second tree spreads out into six branches, including counter space, cabinets surrounding the stove, a designated drink station, the refrigerator, and the pantry. http://www.xmind.net/m/NeRk

The satisfaction that comes with people-watching is doubly-true for monitoring test results. How are people reacting to this navigation? What made sense to them? What didn’t? And then, how many more people should I test this with to know if this is a universal concern?

How many people should I test with?

Clients often ask how many people should take the tree test. It depends on how comfortable you are with a higher margin of error. For more on statistics, you should check out Jeff Sauro’s Using Tree-Testing to Test Information Architecture.

At this point, you could potentially go into basic statistics and talk about confidence rates, sample sizes, and margins of error. How many participants would you have tested to be 95% confident that your users can find your content?

Say, there are about 100 people who would traverse through my kitchen in its lifetime (population size). Round 1 had 27 complete responses (sample size) and a success rate of 91%. Using the margin of error calculator with a confidence level of 95% (meaning there’s a 95% likelihood that the sample accurately reflects the attitudes of your potential taxonomy users), I determined that the margin of error is 17%. That means that there’s a 95% likelihood that between 74% (91-17) and 108% (91+17) of all the people in my kitchen would actually be able to find stuff in my kitchen.

Round 2 had a sample size of 20 people, of which 95% were able to find stuff in my kitchen successfully. Using the same formula, the margin of error is determined to be 20%. Note that the margin of error here has changed because the sample size is smaller. I can be certain that between 75% (95-20) and 115% (95+20) of people would be able to find stuff in my kitchen. Apparently, it’s a Very Organized Kitchen.

However, the difference between the two wasn’t very substantial. Ideally, I’d test with more people so that the margin of error is smaller.

Factors to consider when determining the size of the margin of error:

Sample size: Use the number of participants who completed the test.
Percentage: Use the worst case scenario (50%) if you want to determine a general level of accuracy. In my example, I used the overall success rate for each round.
Population size: Use the number of people that the group your sample represents.

What KPIs should I look at?

Tree testing generates a few key performance indicators to keep in mind:

Overall score
Task success rate
Task directness rate
Time taken per task

Overall score

The overall score is a weighted average of task success and directness. TreeJack founder Dave O’Brien says that an aggregate score of 8 or above for a task means that no changes need to be made (source). Lower scores indicate that it is an area that requires attention. When you start analyzing your tree test results, be sure to focus on the lower scores.

Both rounds of testing tell me that I need to focus on where to put the dry rub spices, custard powder, and that bag of chia seeds. You’ll read about that later.

Success

The success score refers to the percentage of participants who selected a correct answer, regardless of whether or not they had to jump around the tree a few times before doing so. A success score of around 80% or more is considered a good score for a task, says Optimal Workshop.

There is an overall success score across all the tasks in each round as well as for each task. Round 1 scored 91% overall and Round 2 fared a bit better (95%). Good job, Grace, good job. But the Asian in me asks why it’s not 100%. How can we make it 100%?

Directness

The directness score is the percentage of participants who did not backtrack at all when selecting an answer, whether their answer was correct or incorrect. It attempts to measure how confident participants were in selecting their answer, though it’s important to not assume too much about what participants were thinking.

Between round 1 and round 2, it appears that fewer participants (58%) backtracked in Round 2 before reaching the correct answer as compared to round 1 (54%).

Time taken

Time taken is a box plot showing the time taken for participants to complete the entire survey (in minutes) as well as per task (in seconds). In the kitchen, time is of essence when you have stuff on the stove. My mom doesn’t measure out ingredients ahead of time; she measures by eye, something I’ve also noticed my father, in-laws, and friends do.

I set the first task about finding salt to be the training question, an easy one so participants would be able to see how the test is run and know what to expect for the other questions.

Analyzing the results: focusing on the problem areas

Overall, both tree tests showed that people are in fact able to navigate my kitchen pretty well. Sure, there were a few hiccups *ahem, chia seeds, dry rub, custard powder, coconut milk*, but almost everyone was able to come their tasks successfully.

Based on the overall score, I know that I should focus on any task that scored lower than 8, so that leaves us with chia seeds, dry rub, custard powder, and coconut milk. I know that directness is lacking, so how can I improve on that?

Looking for chia seeds

Finding chia seeds for my breakfast smoothie was probably not the best task to use.

Not many people know what chia seeds are used for.
There are many things that chia seeds can be used in (baked goods and beverages, to name two).
How would I manage user expectations for chia seeds as a food or nut item, not associated with drinks and beverages?

In fact, more than half of the participants (53%) indicated that finding chia seeds was a fairly difficult to very difficult task.

If you look at the pietree, you’ll see that people eventually looked everywhere they could. The green lines indicate the expected paths. The thicker lines indicate the popularity of that path. More than three-quarters searched the pantry first; 19% peeked in the refrigerator. Eventually I decided to place this in the pantry with flaxmeal, seeds, and nuts, which I also mix into breads and muffins.

Getting the dry rub for the ribs

What’s a summer without a BBQ? Dry rub was a task that I kept in both rounds. Could you tell that I was having trouble trying to figure out where this belonged?

Refer to the pietree below. The assumed correct path is the one highlighted in green. About 88% of participants went to the pantry first, before looking around the stove.

In round two, I designated more areas around the stove for spices (for cooking savory dishes). While not the first place they went to (55% went to the pantry first), almost everyone (90%) looked in the cabinet next to the stove during the task. Even though more participants were able to find where it was stored, it took them 70% more time to find it.

In the end, I’ve opted to keep all savory spices next to the stove. Additional contextual inquiry will help inform how that decision will hold up. Cue observing in-laws putting groceries away in the kitchen.

Is it a flour? Is it a starch? No, it’s custard powder!

Let’s talk about custard powder! Lots of participants commented that they had no idea what custard powder is (note that all the test participants are from the United States). According to Wikipedia, it is popular in the United Kingdom for making custards—also known as pudding—without eggs.

Some indicated that they’d consider it a flour and looked for it near the flour section. Some thought that it would be classified with other “powders.” To be honest, I wasn’t sure either and left this for you to best tell me where to store it. Eventually, I kept it with the rest of the baking powders and I haven’t had an issue with finding it yet.

So what’s the verdict?

Every taxonomy is subject to stakeholder drama and idiosyncrasies. In a kitchen, this is dictated by the physical layout and its workflows and even colloquial nuances in understanding terminology. What evolves as a stable enough navigation for my kitchen won’t necessarily work for yours.

Here’s what you can get out of this tree test:

Assess the existing navigation. Be sure to start first by tree testing your navigation. You’ll find out where your trouble areas are and how it compares against your proposed sitemaps.
Oh, and when deciding on the tasks, be sure to focus on common tasks and not so much on the outliers and special cases.
Opt for more overlapping tasks if possible and especially if you’re planning on iterations. Having the same tasks that appear in both iterations helps set benchmarks. In the end, I only had three overlapping tasks: salt, dry rub, and sriracha sauce.

No one had trouble at all with finding salt, sriracha sauce, and coffee, so it’s not worth the time to fix what’s not broken.

At the the same time, it’s important to identify the problem areas. Dry rub, coconut milk, and custard powder were definitely outliers and worth digging deeply into.
Minimize variables between iterations. Try not to change the wording too much. You want to be sure that you can pinpoint how much effect the change in the navigation structure has and not worry about how the wording has affected the task. Dry rub was one I introduced more than one variable. I changed the wording in the task. I changed where it lived in the taxonomy. How would I know which variable attributed to the higher success?

Epilogue

I took some time after the tree test ended and implemented some of the changes that folks recommended. I created a “drink and beverage” station with cups and coffeemaker and hot water kettle nearby on the kitchen counter. I rearranged where we kept plates and rice bowls. My husband and son had a quick introduction to the new arrangement before the in-laws came back.

Then the in-laws returned from their six-month sabbatical, and I’ve had a fair amount of time (about five months since I conducted this tree test) to observe their interactions in the kitchen. My parents have also been over to conjure up more meals since I’ve executed the new taxonomy.

The good news?

Spices next to the stove are easier to spot.
Sriracha, XO sauce, and hoisin sauce now have a dedicated spot in the refrigerator door.
New dedicated section was added in the pantry for nuts, including chia seeds.
The “flours and starches” area now has separate sections for wheat flour, gluten free flours, starches, yeast, and powders (including custard powder)

The bad news?

Sugar moves back and forth between the small jar next to the stove to the pantry where there’s a whole section devoted to “sugar and honey.”
Sesame seeds are hard to find. It’s a topping really, not cooked as a grain or nut.
Because they’re used for the same purpose (thickening), sweet potato starch, cornstarch, and tapioca starch could be combined in one container. I shook my head so hard on this one.

Now that this taxonomy is working for most of my users, it’s time to turn my attention to governance—maintaining the integrity of this taxonomy and creating rules for adding new taxonomy concepts and removing obsolete or outdated ones.

This series started out as a justification of why taxonomy should be an essential part of a website redesign, and for this instance, of a kitchen reorganization. If you’re wondering how we got here, check out what I’ve been doing in my kitchen.

References

Dave O’Brien “Tree Testing“
Dave O’Brien “Tree testing in the design process — Part 1: The research phase”
Dave O’Brien “Tree testing for websites” – a comprehensive guide
Jeff Sauro “Using Tree-Testing to Test Information Architecture“
Kirbie’s Cravings. Hong Kong Egg Waffles

The post Could You Hand Me the Dry Rub Please? appeared first on Boxes and Arrows.

Card Sorting a Kitchen Taxonomy

Grace G Lau — Tue, 14 Jun 2016 11:34:07 +0000

This is the fourth in a series of real-life examples of taxonomies found in my kitchen. Part 4 of “Taxonomy of Spices and Pantries” looks at how card sorting studies can inform a taxonomy.

Building the business case for taxonomy
Planning a taxonomy
The many facets of taxonomy
Card sorting a kitchen taxonomy
Tree testing
Taxonomy governance
Best practices of enterprise taxonomies

The son raiding the refrigerator. Credit: Grace G Lau

I started this series as a way to share my journey as I organize my kitchen. At this halfway mark, I’m actually quite proud that I’ve made it this far and that you’re here to mark the occasion, so thank you for reading! In the next half of this series, I finally get to start showing you the 80% of the time and effort that goes into taxonomy development.

The business value proposition of an enterprise taxonomy/metadata can only be measured by analytics. How much time is wasted in searching and not finding information? How much is that in lost revenue?

We started the discovery process for this taxonomy by understanding the project scope and domain. Who are the users? What content is being included or not? What is the primary purpose for this taxonomy?

While a final taxonomy is a list of sorts, the sign of a solid, sustainable taxonomy is in its ability to be flexible and scalable. In what ways can this taxonomy be applied in other content or information systems? How do users think about the content and find what they need?

Always, always consider the various applications of this taxonomy. If not applied consistently and coherently, developing a taxonomy is a wasted effort from an information architecture perspective.

A pantry with no taxonomy in place.

Thus, going back to users to validate and confirm assumptions is a fundamental part of the recipe to keep in mind, because all this planning and re-organizing is in vain if my in-laws come back in six months and the kitchen reverts to its original state.

In this next piece, I walk through my card sort analysis to further define my next steps.

A taxonomy crafted from the business perspective does not last.

As the taxonomist, I could arbitrarily create groupings that make sense only to me and be done. Who cares if my in-laws can’t find the corn starch? They’ll just have to ask me and over time they’ll figure it out, right? No!

When we moved the first time, this was exactly what happened. I was told to organize the kitchen and I did. Over time, the kitchen reverted to things being stored based on convenience and portability. It wasn’t bad, but it wasn’t efficient (cue three containers of baking soda).

Designing only by the business objectives—without user research—is the old way of designing a web site. The lack of research and affirmation from other stakeholders demonstrates how a new look and feel doesn’t make a new system efficient; you must consult your users.

Know your content. Create a content inventory.

Going through everything in my pantry and spices cabinet, I pulled out every single item and captured the following:

Product name (in English)
Product name (in Chinese)
Quantity. Eventually, this column became tedious and I stopped trying to track this.
Category/Purpose. In the early stages, I used this to capture the original organizing mechanism used.

Unless users of the taxonomy were interested in brand products and tracking cost per volume, I wasn’t about to spend the time to capture the additional properties for each item:

When or where it was purchased
How much each item contained (volume/quantity)
the product brand name

However, when you’re building your content inventory, this type of additional metadata may be useful information. To determine how much information to capture, go back to your users. Is brand an important facet that your users reference? Is hot sauce referred to as a generic “hot sauce” or the specific brand “Sriracha” or “Tabasco”?

Screenshot of early stage of content inventory. October 2014

At this early stage, I started assigning items to a high-level category based on the item’s current position in my storage: Flours, sugars, essences and extracts, sauces, vinegars, oils, beans, noodles. Their current placement also implied underlying organization systems.

Some items were placed in multiple places (categories). For instance, my father-in-law kept his stash of spices separate from the rest so that he could quickly pull the spices that he frequently needed. This is how we ended up with three bags of baking soda: one my father-in-law kept, one that I kept, and the 5 lbs. of baking soda we bought from Costco.

Sometimes items may be available under more than one category. This is called a polyhierarchy, and it happens when there are items that share the same hierarchical parent, such as a generic-specific, instance or whole-part type of relationship. This is perfectly fine in a digital taxonomy.

In a taxonomy of physical items, like the spices one we’re discussing, creating polyhierarchies could potentially undermine the navigation scheme and create confusion. Cinnamon, a prime example, can be used in both savory and sweet recipes. Going deeper, curry powder could be classified as an instance (example) of Indian spice as well as a main ingredient in many other Asian curry dishes. If we consider the cultural, geographical and political implications of classifying spices, you’ll find another polyhierarchy. To handle this in a physical taxonomy, it’s best to conduct a tree (navigation) test and see how users tend to associate an item and let your users determine the best placement based on usage.

On the other hand, although a digital taxonomy for spices doesn’t have the same restrictions as a physical one, you probably should create rules for when that happens (ahem, that’d be documented in a taxonomy governance guide). There should be a general overarching organization scheme that is logical and easy for your users to follow. If there are too many polyhierarchies, the taxonomy becomes too difficult to follow. Having a clearly defined structure is important so this is another reminder to keep it simple (K.I.S.S.).

Validate your assumptions using a representative card sort.

My in-laws were out of the country, so I couldn’t get their input with a manual card sort. Instead, I decided to see how other users think about their kitchen organization using a virtual, unmoderated card sort. I wanted to find out:

How did other people/users organize their kitchen?
What labels would they use to name the groups?
What things would they consider to be part of their spices/pantry?
What other considerations would they have for their spices/pantry?

The study was set as an open sort because I wanted to know what labels people used to call those broader categories. I also asked a set of open-ended questions to gather additional information about how participants thought about their kitchen.

I narrowed down the card sort to 60-70 items. If I were conducting a moderated, in-person card sort with physical index cards, 200 cards would not have been too daunting. Meanwhile, 200 cards in an online card sort would have felt tedious and never-ending for a card sort participant.

I also focused on selecting cards that represented a consistent level of detail. Salt wasn’t just salt. It was table salt or sea salt. Rice was jasmine rice or brown rice. Vinegar was rice vinegar or white vinegar.

When you’re setting up your own card sort, be sure to consider your participant’s experience. If I’d had more time, I would have also included pictures and short descriptions of each item. If you’re debating about what to include in your card sort, see Donna Spencer’s Card Sorting book and her article.

64 cards. 288 categories. 36 users.

Beginning of a card sort.

The card sort ran for about 2½ weeks in February 2016. I shared it across Twitter, Facebook, and LinkedIn. It reached 81 people, of which 44% (36) completed the sort of 64 cards and the questionnaire at the end. I offered one pound of assorted chocolates as an incentive; my son’s school was having a fundraising event.

I asked two open-ended questions at the beginning and five open-ended questions at the end of the exercise. I don’t know what I was thinking, but 64 cards x 288 categories x 36 participants x 7 questions = 4,644,864 data points.

Pre: What are the top three tasks that you engage in in the kitchen?
Pre: What is your primary role in the kitchen?
Post: What was your overall approach to organizing the cards?
Post: Are you happy with the overall groups that were created?
Post: Were there any items that are missing that weren’t included?
Post: Were there any items that you would have called by a different name that you recall?
Post: Do you have any additional feedback or comments that you’d like to include?

Screenshot of my card sort showing all 64 cards in groupings.

What the results told me about myself

The results told me that I had missed whole groups of pantry items: teas, canned goods, cereals, and granola bars.

It made me realize that I was not as objective as I thought.

One participant added:

I’m surprised no canned goods made the list. I have found it interesting how people group those things, as well. I’ve seen some give them a specific and separate area (that’s me!), some people keep them with all wet ingredients, and some people may even keep certain canned/wet things with seasonings.

Incidentally, I do have a canned section of my pantry, but it has only tuna fish and tomato paste. Same goes for cereals: We haven’t really eaten cereal until recently. My son came home from kindergarten one day asking for Lucky Charms, so we bought a box and then two more.

Quite a few people brought up sauces that I missed, especially Sriracha and Tabasco sauce, because I usually keep those in the refrigerator. Then I noticed that I keep hoisin sauce and oyster sauce in the cabinet close to the stove. Then I realized that I have inconsistent food storage management practices.

At the same time, the results told me that I had made a lot of assumptions about the items I use and the people who use them. In the beginning, I had dismissed collecting brand information, thinking that it was all extraneous information. In fact, brand actually plays a big role in how we referred to things. Hot sauce is not just “hot sauce.” My husband prefers Sriracha sauce for rice and Tabasco for eggs. A certain brand’s black vinegar results in better pig trotters than others, which is why recipes sometimes call out certain brands of ingredients. As a result of this finding, I should include brand as part of my taxonomy scope. See that there? That’s scope creep.

Although coffee and tea were part of the original list, I thought I had a solid handle on where they should go so I omitted those from the sort. I let my personal experience take over and limited the card sort that way. As you plan your own card sort, consider these universal truths:

Check your world views at the door. This is a learning opportunity. Don’t let assumptions stop you from learning how others see the world.
Don’t be a completionist. With practice, you’ll get a better handle on how to plan, execute, and analyze card sorts. Summarize key findings and share with your team as soon as possible. *cough* (Don’t wait months and lose your stakeholders’ trust.)
Rinse and repeat. If your categories don’t work or are too broad, modify or test with a subset of cards and try again.

What the results told me about the items

Thirty-six participants grouped and labeled the categories in 288 different configurations.

Going through the card sort results, I attempted to “standardize” the category names. I lumped together the categories that didn’t have unique labels or unique cards in the groupings. From these, I tallied up the organization schemes that the standardized categories followed. These revealed other useful information to factor in the taxonomy.

Organization scheme	Examples
Function	Activator, additive flavoring, baking, leavening, condiments, sweeteners
Frequency of use	Everyday, advanced, daily seasonings
Cuisine type	Asian, Italian
Type	Sugars, extracts, herbs, cooking liquids, vinegars, staples, grains, flours, sauces, salts
Storage location	Freezer, near the stove
Taste	Sweet things, cookies, savory

For instance,“baking” as a category label had an overwhelming response. Of 36 card sort participants, there were 28 different configurations to what “baking” as a category should include. I could potentially break this out into its own card sort and see how folks group this. Here are the labels for baking:

baking
baking ingredients
baking liquids
baking needs
baking spices
baking stuff
baking supplies
baking sweeteners
baking powders

Category names that spoke to an item’s function and the items that were grouped together in those categories turned out to be a subtle teaching opportunity. For example, when I investigated the category name “leavening,” I learned why baking soda and baking powder are added in baking recipes.

activators
additive flavors
condiments
flavouring
flavor enhancers (baking and beverage)
thickeners
thickening agents,
starch and thickening stuff
leavening
sweeteners
baking sweeteners

The least number of unique categorizations was for curry powder, ground ginger, and white sugar. Of 36 participants, these had 15 unique categorizations, meaning that this card was sorted into 15 unique categories by participants. Each category may have a different composition or label. Compare this with pepper oil, cooking wine, and flaxseed meal which all had more than 25 unique categories.

Having fewer unique categories implies that participants have more agreement on the label of the category. More categories may suggest more varied interpretations of the item and the categories in which it may live.

Side by side view of cards that have the least and most number of unique categories.

Looking through the groupings, there were a few other groups that had some element of temperature. One person placed flaxseed meal in the “freezer” which I had not done. Other things that were grouped for the refrigerator category included active dry yeast, fish sauce, soy sauce, and cooking wine. My parents never stored soy sauce in the refrigerator, and so I followed suit. Maybe there should be a mini-fridge for all these sauces next to the stove? Following this, I intend to include “storage handling” as another property to track in my taxonomy.

And of course there were distinctions that people brought up around herbs—fresh, dried, and ground—that weren’t called out in the cards.

There were category labels that spoke to where it is stored:

cooking starches near the stove
cooking liquids—refrigerable
always on the counter

A couple participants indicated in the post-exercise survey that their current organization scheme is dictated by the amount of available space and size and weight of the item’s container.

As a category, “condiments” had a wide range of items—from 2-17—and the items didn’t overlap so much. I had to look up the word “condiments” to make sure I understood what it meant. My mental model limited ketchup, salt, and pepper as things that people can add after food has been prepared. Apparently, condiments are additives during or after preparation to enhance flavor, and the category could then include soy sauce and Sriracha as well.

When you’re building a taxonomy for a client… well, in an ideal process, term research is an activity that should come before user research to evaluate taxonomy. However, user research is the non-expert technique to understand how users, not experts, relate to terms. Ultimately, you should defer to terms that your users are using.

There were categories that spoke to cuisine:

Asian
Asian condiments
Italian food

Of course, the number of cuisine categories is limited to the cards in the exercise, which was based on my own cooking habits and my kitchen inventory.

What the results told me about other people

The results also revealed the different approaches that participants had toward their kitchens and their cooking.

One participant named a category “Advanced spices.” Was “advanced” referring to a higher level of understanding of cooking? On what basis? Was it used in special dishes for special occasions as opposed to an everyday home-cooked meal? Was it referring to a complex flavor that is not usually encountered in homecooking? As you can tell, this could only get so far with remote testing and no way to ask questions.

There were category labels that spoke to how frequently something was used:

everyday
for that one recipe
daily seasonings

In a moderated card sort, this would be a great opportunity to ask for clarification around “everyday” and “daily.” Does this mean there is a basket of rarely used spices tucked away somewhere?

Nevertheless, no matter what the category was called, there were some clear groupings that emerged. For this, I used the best merge method, an industry standard for card sort analysis based on the frequency of two cards that occur together, and determined the following categories:

Flours
Noodles and pastas
Rice
Sugars
Thickeners
Extracts
Activators
Sauces
Oils
Vinegars
Salts and peppers
Herbs
Spices

Where you draw the line before smaller, unique categories turn into larger, discernible categories is subjective. Sure, the grocery store has a huge section for produce, but even within the produce section there are lines between root vegetables; green, leafy vegetables; and fresh herbs. But this distinction varies by the situation.

If you follow the diagram below, you’ll notice that the flours eventually merge with the noodles and pastas; vinegars, sauces, and oils merge into cooking liquids. When do you stop combining categories? Use your best judgement at this point.

Screenshot of Best Merge Method from Optimal Workshop, Feb 2016

Some participants mentioned that they’d like to see more granular groupings of herbs. There are definitely more items that are missing from the set of 64 cards. In fact, I’d expect that the average kitchen has at least four times the amount of stuff listed here.

As I mentioned before, this particular card sort is based on the contents of my personal pantry and spices collection. Many participants mentioned that there’s hella more in their own pantry, and I say, of course! I’m only asking for 20 minutes of your time and so keeping the list to a select 60-70 would help reduce fatigue.

Even for an enterprise project, you may want to consider limiting the number of cards in a remote (unmoderated) card sort. In a moderated card sort with physical cards, however, up to 200 cards is manageable in individual card sorts (not team card sorts) if the cards cover broader topics that are easier to group.

As you work through your card sort results, you may find that your understanding of the domain is not enough. Your findings can actually shake up your prior experience. In doing this card sort, I am humbled by how little I knew.

What do you think is the most useful purpose for a kitchen taxonomy?

Being able to find things was most important. Here is a breakdown of the importance for the purpose of the kitchen taxonomy.

Piechart showing participant response to kitchen taxonomy purpose

30.6% To find foodstuff when I need it
25% To bridge content across recipes, pantries, and chefs
19.4% To help with putting foodstuff away in consistent locations
13.9% To create a common language across its users
11.1% To support navigation around the kitchen

What are the top three tasks that you engage in the kitchen?

This was an open-ended question and it provided a nice synopsis of what goes on in one’s kitchen. Here’s a quick list of things that people do in their kitchen:

Cooking
Cleaning
Finding food
Conversation
Getting wine
Charging mobile devices
Making coffee
Organizing
Sex
Watching TV
Exercise

What is your primary role in the kitchen?

This was another open-ended question and the responses quickly illustrate how one’s role in the kitchen may also dictate their importance in the household as well.

Consumer, cook, chef, boss, Goddess of Food and Fire
The Grand Overseer. The Key Master and Gate Keeper, both. GOD.
Queen / Supreme Authority
dishwasher
forager

In developing the questions for your own card sort, think about how the answers to your survey questions might affect the final taxonomy design. The categories created have to be relatable to all users of the taxonomy, which not only includes the people who cook but also those who put things away.

Things to consider

When I first looked at the results three months ago, I cringed. I fretted over the questions that I asked, that I should have asked. I agonized over what I wanted to find out, what I could have found out.

I was in a rut to figure out how this analysis could have ended up. I could have, should have asked in the post-test survey some quantitative questions, such as participants’ level of satisfaction with the groupings.

At the end, I realized that it was enough. What matters is that I learned a lot about who I am, what I was trying to do, who the users are, and what was important to them. And I will learn more about them as I continue my process of validating this kitchen taxonomy. Meanwhile, I will continue to develop the terms in my taxonomy.

In the end, everyone’s kitchen taxonomy is subjective to their own models of understanding and personal approach towards food. Your kitchen taxonomy is not my kitchen taxonomy, but I will be asking you to find things in mine. Block party!

I have an upcoming milestone: My in-laws are coming back from their half-year in China next month. I’ve set up a tree test to evaluate the taxonomy that I have so far (closes June 30, 2016). My next post will walk through tree testing the kitchen taxonomy. Follow me on twitter @lauggh to get the latest update!

The post Card Sorting a Kitchen Taxonomy appeared first on Boxes and Arrows.

The Many Facets of Taxonomy

Grace G Lau — Tue, 02 Feb 2016 08:00:21 +0000

This is the third in a series that has become real-life examples of taxonomies found in my kitchen. Part 3 of “Taxonomy of Spices and Pantries” looks at where and how facets can be used as multiple categories for content.

Building the business case for taxonomy
Planning a taxonomy
The many facets of taxonomy
Card sorting a kitchen taxonomy
Tree testing
Taxonomy governance
Best practices of enterprise taxonomies

Using my disorganized kitchen as an analogy, I outlined in part 1 the business reasons why a kitchen redesign needed to focus on taxonomy. I’ve moved often and content migration gets pretty ugly in the pantry. After a while, content creators are quick to stuff things into the nearest crammable crevice (until we move again and the IA is called upon to reorganize).

In part 2, I started planning and outlining the scope of this kitchen taxonomy project. Who are its users and core stakeholders? How do they move around the kitchen? What content in this domain would be covered in this taxonomy and where do we draw the line?

However, a simple list of pantry and spice categories is not enough to demonstrate the potential of taxonomies. A neatly organized spice drawer doesn’t represent a sound taxonomy unless there lies some underlying understanding of how the spices are used and in what context.

Moreover, taxonomies have many uses, and creating a retrieval scheme is just the beginning. For this, my kitchen pantry analogy expands to look at recipes as real-world use cases to understand the relationships of spices with one another and other facets as we explore how that content is used and referenced.

Recipes can be used to understand how spices are used. Facets that describe content objectively reveal how users look for content.

Bridging content across silos

With the holidays wrapped up, my son back in school, and packed school lunches back in the daily routine, I have to re-assess the existing kitchen taxonomies as I look through my recipes to figure out staples for lunch and dinner: Are we going paleo, or are rice and noodles back on the list? Am I cooking in the moment, or am I premeditating with slow-cooked meals?

I collect recipes in two main places: Evernote and a 3-ring binder. Recipes come from everywhere: recipes from cookbooks borrowed from the public library, recipes from cookbooks that I’ve purchased digitally or in print, recipes printed from notable food blogs and recipe databases, and even recipes from the plastic bag that holds the chocolate chips.

It usually starts off as someone else’s recipe printed from the web. I print out the ones I intend to try. Rather than simply saving them in Evernote (or in Pinterest for that matter), printing them out allows me to include it for sure during some meal and make adaptations. Trying to update Evernote in the middle of cooking makes for a rather grimy smartphone. Printed, the recipe is protected from kitchen splatters by sheet protectors. Sticky notes annotate what ingredients were substituted or should be tried instead next time, whether it had a favorable reception and merited an encore, or that the recipe had been attempted and should never appear again on the table (pig hock noodle soup, I remember you). This compilation of sticky notes documenting my trials turns the recipe into my own. Then I copy, add my notes in Evernote, and reprint the reworked recipe for the binder.

When you start thinking about your taxonomy, you should keep in mind that this is an opportunity to build a consensus across your content silos. See where content is being created and how it is being used. A kitchen example of a department silo is everyone buying their own container of baking soda and storing it on their personal shelf. Apparently, baking soda has at least six different acceptable names in Chinese. When my father-in-law couldn’t find “the one,” he went and bought another—when we already had a 5 lb. bag of baking soda from Costco.

Pages from my recipe binder for pancakes and soy-braised chicken. Credit: Grace G Lau.

Organizing in hierarchies

The thing about print is that there is only one way to organize content. Hierarchies dictate that content is categorized into fixed groups and mutually-exclusive sub-groups based on a single facet.

Imagine that I organize my recipes instead as breakfast, lunch, dinner, and dessert. Does that mean I can only have noodles for lunch and not dinner? With a hierarchy structured by meal time, I am limited.

Cookbooks perpetuate this way of thinking. Take a look at how Paleo Comfort Foods chefs Julie and Charles Mayfield organize their recipes in the table of contents:

Starters and snacks
Sauces and staples
Soups and salads
On the side
Main dishes
Desserts

Although one-dimensional, hierarchical taxonomies exist to create structure and provide focus.

As the binder grows to include more recipes, these groupings help provide organization and avoid a massive dump of recipes at the back of the binder:

Soups
Noodles
Entrées
Appetizers
Sweets

At the same time, when compiling a meal plan for the week, …er, day, flipping through a section of entrées wouldn’t divert me to bake madeleines for a whole afternoon.

What’s leftover of orange-, pandan- and chocolate-flavored madeleines. Of course they disappeared after I took the picture. Credit: Grace G Lau

Organizing using facets

In print, facets are the the alphabetical index at the back of the binder. For a hardcopy cookbook, it’s definitive: Page numbers won’t change, recipes won’t be added until a second edition, and the ways you can understand the recipes are limited. The table of contents and the index of facets at the end of the book are set. Imagine the time and effort to maintain an index for a constantly changing recipe binder.

Organizing recipes by facets instead of a singular hierarchy is an opportunity to discover many dimensions of understanding recipes and the uses thereof. Identifying the various facets allows us to use more than one taxonomy at a time in the system.

Thus, a faceted taxonomy for content saved digitally has a better chance of survival. It is scalable, flexible, and changeable at a moment’s notice. Facets used to classify content are not just used for navigation (in a table of contents) or a curated list of key terms (in an index). Facets can be used as search filters, as an interactive label, or as metadata embedded in the content.

NYT Cooking features a faceted search to surface recipes. Thank you for not mixing “Vegetarian” as a cuisine type. Credit: New York Times Cooking, January 12, 2016

Deciding on facets

But how will I determine which facets to use? By taking content and breaking it down into its smaller components, I would be able to describe the recipe enough to find it again.

Take a recipe for Phở Bò (Vietnamese beef noodle soup). Pho can be classified as a noodle dish and a soup. It has a cultural origin. It can be cooked stovetop for ten hours, slow-cooked for eight hours, or pressure-cooked for one hour. It has a core mix of spices to create that unique flavor.

A quick web search pulls up multiple variations of the basic pho recipe. One version from Jaden Hair’s Steamy Kitchen has a somewhat different mix of spices from another version listed in Michelle Tam and Henry Fong’s Nom Nom Paleo: Food for Humans.

With so many duplicate recipes of the same dish, how do I determine the one definitive, reliable pho recipe that I can adapt to my family’s taste? I’ll have to try them all.

Meanwhile, would pho turn out differently if I use a different cooking method? Does it make sense that I have separate pho recipes written for the slow cooker, pressure cooker, and stovetop? I’d have to have strict content governance rules in place to keep this binder well-maintained.

Moving on, is the hardware used an important characteristic to describe the recipe? Yes, definitely. The equipment being used has a direct impact on how much time is spent cooking. For instance, I would search for slow-cooker recipes so that I can come home to a hot dinner in a pot.

Choosing the correct term to use in my faceted taxonomy is another consideration. Do I use the brand name as the preferred term or the generic reference? For instance, the recipe could be classified using the brand name “Crock Pot” or the generic term “slow cooker.” Which term would I most likely use or search for?

Determining the facets used in a taxonomy is all about understanding the content and the user’s needs and workflow.

Do I have banh pho noodles? I’m not an expert in Vietnamese cuisine; I just like to eat pho. How could I learn more about these noodles and how to cook them? Those are the noodles that you specifically use for pho. If it’s the fresh ones, they should be stored in the fridge and used sooner than later. If they’re the dried ones, they should be in the pantry with the other dried noodle types, including somen, ramen, vermicelli, and Italian pasta. Expanding this taxonomy to include pictures and a description of general uses and nutritional benefits turns this taxonomy into a learning tool.

A Chinese household usually has oxtails and chicken feet around for making soups and broth. Sometimes I could end up with an overflow of oxtails sitting in my freezer due to nice deals at the supermarket. How can I find all the relevant recipes that use oxtails? Including ingredient as a facet also helps focus a general recipe search.

What about time spent in preparing and cooking? I often need to retrieve recipes that take 30 minutes or less. Depending on which cooking method is used, the time spent can vary from 2 to 10 hours. It’s 3 hours to expected dinner time, and I need a recipe. This is an important enough characteristic for recipes that you often see “Quick” or “Easy” as popular facets used across cookbooks and recipe repositories.

When you start developing your taxonomy, you’ll want to start by brainstorming facets based on your content. See which ones are more inclusive and which are too exclusive. “Brand” may seem like a good facet… but for a product taxonomy. Consider the need of capturing all your Lee Kum Kee sauces in one place. For the average household kitchen, “brand” may be unnecessary.

Once you have your preliminary set of facets, evaluate them against your users: Will all of your users use all of the facets, or are there natural subsets?

My in-laws often bring tea boxes back as gifts, and we supplement our tea selection with other tea types, including green, black, and herbal teas like chamomile and chrysanthemum. My mother-in-law knows which teas are more well-known and satisfying to bring out for gongfu tea when we have company. For my users, the origin of a tea is a natural start of the conversation.

Dinner’s ready!

In this part of the series, I’ve shown the beginnings of a taxonomy that could unify both the recipe content as well as the spices and pantry content. It would streamline and optimize the daily task of meal planning and preparation by enabling different ways to search and retrieve recipes.

A taxonomy isn’t simply a controlled list of spices and pantry items. A taxonomy may be extended to the organization of a recipe binder (navigation), the recipe format in Evernote (metadata), and the cookbook library. Making use of taxonomy in these various ways, I’m able to optimize the search and retrieval language being used and bridge those content silos.

In preparation for my next post on card-sorting, I have put together a card sorting exercise that shouldn’t take more than 15 minutes, unless you enjoy exploring ways to organize chaos like I do. I walk through the entire process of getting a card sort from planning to analysis and show how a card-sorting exercise can be used to gather user research and inform your taxonomy. Your responses will inform the analysis for the next post on using card sorting to gather user research.

The study is open until February 7, 2016, and I’m offering 1 lb. of assorted chocolates from See’s Candies to two lucky people.

SaveSave

The post The Many Facets of Taxonomy appeared first on Boxes and Arrows.

Planning a Taxonomy Project

Grace G Lau — Tue, 20 Oct 2015 08:00:22 +0000

This is part 2 of “Taxonomy of Spices and Pantries,” in which I will be exploring the what, why, and how of taxonomy planning, design, and implementation:

Building the business case for taxonomy
Planning a taxonomy
The many facets of taxonomy
Card sorting
Tree testing
Taxonomy governance
Best practices of enterprise taxonomies

In part 1, I enumerated the business reasons for a taxonomy focus in a site redesign and gave a fun way to explain taxonomy. The kitchen isn’t going to organize itself, so the analogy continues.

I’ve moved every couple of years and it shows in the kitchen. Half-used containers of ground pepper. Scattered bags of star anise. Multiple bags of ground and whole cumin. After a while, people are quick to stuff things into the nearest crammable crevice (until we move again and the IA is called upon to organize the kitchen).

Planning a taxonomy covers the same questions as planning any UX project. Understanding the users and their tasks and needs is a foundation for all things UX. This article will go through the questions you should consider when planning a kitchen, er, um…, a taxonomy project.

Rumination of stuff in my kitchen and the kinds of users and stakeholders the taxonomy needs to be mindful of. Source: Grace Lau.

Same as a designing any software, application, or website, you’ll need to meet with the stakeholders and ask questions:

Purpose: Why? What will the taxonomy be used for?
Users: Who’s using this taxonomy? Who will it affect?
Content: What will be covered by this taxonomy?
Scope: What’s the topic area and limits?
Resources: What are the project resources and constraints?

(Thanks to Heather Hedden, “The Accidental Taxonomist,” p.292)

What’s your primary purpose?

Why are you doing this?

Are you moving, or planning to move? Is your kitchen so disorganized that you can’t find the sugar you needed for soy braised chicken? Is your content misplaced and hard to search?

How often have you found just plain old salt in a different spot? How many kinds of salt do you have anyway–Kosher salt, sea salt, iodized salt, Hawaiian pink salt? (Why do you have so many different kinds anyway? One of my favorite recipe books recommended using red Hawaiian sea salt for kalua pig. Of course, I got it.)

You might be using the taxonomy for tagging or, in librarian terms, indexing or cataloging. Maybe it’s for information search and retrieval. Are you building a faceted search results page? Perhaps this taxonomy is being used for organizing the site content and guiding the end users through the site navigation.

Establishing a taxonomy as a common language also helps build consensus and creates smarter conversations. On making baozi (steamed buns), I overheard a conversation between fathers:

Father-in-law: We need 酵母 [Jiàomǔ] {noun}.
Dad: Yi-see? (Cantonese transliteration of yeast)
Father-in-law: (confused look)
Dad: Baking pow-daa? (Cantonese transliteration of baking powder)

Meanwhile, I look up the Chinese translation of “yeast” in Google Translate while mother-in-law opens her go-to Chinese dictionary tool. I discover that the dictionary word for “yeast” is 发酵粉 [fājiàofěn] {noun}.

Father-in-law: Ah, so it rises flour: 发面的 [fāmiànde] {verb}

This discovery ensues more discussion about what it does and how it is used. There was at least 15 more minutes of discussing yeast in five different ways before the fathers agreed that they were talking about the same ingredient and its purpose. Eventually, we have this result in our bellies.

Homemade steamed baozi. Apparently, they’re still investigating how much yeast is required for the amount of flour they used. Source: Grace Lau.

Who are the users?

Are they internal? Content creators or editors, working in the CMS?

Are they external users? What’s their range of experience in the domain? Are we speaking with homemakers and amateur cooks or seasoned cooks with many years at various Chinese restaurants?

Looking at the users of my kitchen, I identified the following stakeholders:

Content creators: the people who do the shopping and have to put away the stuff
People who are always in the kitchen: my in-laws
People who are sometimes in the kitchen: me
Visiting users: my parents and friends who often come over for a BBQ/grill party
The cleanup crew: my husband who can’t stand the mess we all make

How do I create a taxonomy for them? First, I attempt to understand their mental models by watching them work in their natural environment and observing their everyday hacks as they complete their tasks. Having empathy for users’ end game—making food for the people they care for—makes a difference in developing the style, consistency, and breadth and depth of the taxonomy.

What content will be covered by the taxonomy?

In my kitchen, we’ll be covering sugars, salts, spices, and staples used for cooking, baking, braising, grilling, smoking, steaming, simmering, and frying.

How did I determine that?

Terminology from existing content. I opened up every cabinet and door in my kitchen and made an inventory.
Search logs. How were users accessing my kitchen? Why? How were users referring to things? What were they looking for?
Storytelling with users. How did you make this? People like to share recipes and I like to watch friends cook. Doing user interviews has never been more fun!

What’s the scope?

Scope can easily get out of hand. Notice that I have not included in my discussion any cookbooks, kitchen hardware and appliances, pots and pans, or anything that’s in the refrigerator or freezer.

You may need a scope document early on to plan releases (if you need them). Perhaps for the first release, I’ll just deal with the frequent use items. Then I’ll move on to occasional use items (soups and desserts).

If the taxonomy you’re developing is faceted—for example, allowing your users to browse your cupboards by particular attributes such as taste, canned vs dried, or weight—your scope should include only those attributes relevant to the search process. For instance, no one really searches for canned goods in my kitchen, so that’s out of scope.

What resources do you have available?

My kitchen taxonomy will be limited. Stakeholders are multilingual so items will need labelling in English, Simplified Chinese, and pinyin romanization. I had considered building a Drupal site to manage an inventory, but I have neither the funding or time to implement such a complex site.

At the same time, what are users’ expectations for the taxonomy? Considering the context in the taxonomy’s usage is important. How will (or should) a taxonomy empower its users? It needs to be invisible; as an indication of a good taxonomy, it shouldn’t affect their current workflow but make it more efficient. Both fathers and my mom are unlikely to stop and use any digital technology to find and look things up.

Most importantly, the completed taxonomy and actual content migration should not conflict with the preparation of the next meal. My baby needs a packed lunch for school, and it’s 6 a.m. when I’m preparing it. There’s no time to rush around looking for things. Time is limited and a complete displacement of spices and condiments would disrupt the high-traffic flow in any household. Meanwhile, we’re out of soy sauce again and I’d rather it not be stashed in yet a new home and forgotten. That’s why we ended up with three open bottles of soy sauce from different brands.

What else should you consider for the taxonomy?

Understanding the scope of the taxonomy you’re building can help prevent scope creep in a taxonomy project. In time, you’ll realize that the 80% of your time and effort is devoted to research while 20% of the time and effort is actually developing the taxonomy. So, making time for iterations and validation through card sorting and other testing is important in your planning.

In my next article, I will explore the many uses of taxonomy outside of tagging.

SaveSave

The post Planning a Taxonomy Project appeared first on Boxes and Arrows.

Building the Business Case for Taxonomy

Grace G Lau — Tue, 01 Sep 2015 08:00:14 +0000

How often have you found yourself on an ill-defined site redesign project? You know, the ones that you end up redesigning and restructuring every few years as you add new content. Or perhaps you spin up a new microsite because the new product/solution doesn’t fit in with the current structure, not because you want to create a new experience around it. Maybe your site has vaguely labelled navigation buckets like “More Magic”—which is essentially your junk drawer, your “everything else.”

Your top concerns on such projects are:

You can’t find anything.
Your users can’t find anything.
The navigation isn’t consistent.
You have too much content.

Your hopeful answer to everything is to rely on an external search engine, not the one that’s on your site. Google will find everything for you.

A typical site redesign project might include refreshing the visual design, considering the best interaction practices, and conducting usability testing. But what’s missing? Creating the taxonomy.

“Taxonomy is just tagging, right? Sharepoint/AEM has it—we’re covered!”

In the coming months, I will be exploring the what, why, and how of taxonomy planning, design, and implementation:

Building the business case for taxonomy
Planning a taxonomy
The many facets of taxonomy
Card sorting
Tree testing
Taxonomy governance
Best practices of enterprise taxonomies

Are you ready?

ROI of taxonomy

Although the word “taxonomy” is often used interchangeably with tagging, building an enterprise taxonomy means more than tagging content. It’s essentially a knowledge organization system, and its purpose is to enable the user to browse, find, and discover content.

Spending the time on building that taxonomy empowers your site to

better manage your content at scale,
allow for meaningful navigation,
expose long-tail content,
reuse content assets,
bridge across subjects, and
provide more efficient product/brand alignment.

In addition, a sound taxonomy in the long run will improve your content’s findability, support social sharing, and improve your site’s search engine optimization. (Thanks to Mike Atherton’s “Modeling Structured Content” workshop, presented at IA Summit 2013, for outlining the benefits.)

How do you explain taxonomy to get stakeholders on board? No worries, we won’t be going back to high school biology.

Explaining taxonomy

Imagine a household kitchen. How would you organize the spices?

Consider the cooks: In-laws from northern China, mom from Hong Kong, and American-born Grace. I’ve moved four times in the past five years. My husband, son, and I live with my in-laws. I have a mother who still comes over to make her Cantonese herbal soups.

We all speak different languages: English, Mandarin Chinese, and Cantonese Chinese.

I have the unique need of organizing my kitchen for multiple users. For my in-laws, they need to be able to find their star anise, peppercorn, tree ear mushrooms, and sesame oil. My mom needs a space to store her dried figs, dried shiitake mushrooms, dried goji berries, and snow fungus. I need to find a space for dried thyme and rosemary for the “American” food I try to make. Oh, and we all need a consistent place for salt and sugar.

People can organize their kitchen by activity zones: baking, canning, preparing, and cooking. Other ways to organize a kitchen successfully could include:

attributes (shelf-life, weight, temperature requirements)
usage (frequency, type of use)
seasonality (organic, what’s in season, local)
occasion (hot pot dinners, BBQ parties)

You can also consider organizing by audience such as for the five year old helper. I keep refining how the kitchen is organized each time we move. I have used sticky notes in Chinese and English with my in-laws and my mom as part of a card sorting exercise; I’ve tested the navigation around the kitchen to validate the results.

Early attempts at organizing my pantry.

If this is to be a data-driven taxonomy, I could consider attaching RFID tags to each spice container to track frequency and type of usage for a period of time to obtain some kitchen analytics. On the other hand, I could try guesstimating frequency by looking at the amount of grime or dust collected on the container. How often are we using chicken bouillon and to make what dishes? Does it need to be within easy reach of the stovetop or can it be relegated to a pantry closet three feet away?

From Home Depot.

Understanding the users and their tasks and needs is a foundation for all things UX. Taxonomy building is not any different. How people think about and use their kitchen brings with it a certain closeness that makes taxonomy concepts easier to grasp.

Who are the users? What are they trying to do? How do they currently tackle this problem? What works and what doesn’t? Watch, observe, and listen to their experience.

Helping the business understand the underlying concepts is one of the challenges I’ve faced with developing a solid taxonomy. We’re not just talking about tagging but breaking down the content by its attributes and metadata as well as by its potential usage and relation to other content. The biggest challenge is building the consensus and understanding around that taxonomy—taxonomy governance—and keeping the system you’ve designed well-seasoned!

Now, back to that site redesign project that you were thinking of: How about starting on that taxonomy? My next post will cover taxonomy planning.

The post Building the Business Case for Taxonomy appeared first on Boxes and Arrows.

Tree Testing

Dave OBrien — Sat, 05 Dec 2009 08:02:46 +0000

A big part of information architecture is organisation – creating the structure of a site. For most sites – particularly large ones – this means creating a hierarchical “tree” of topics.

But to date, the IA community hasn’t found an effective, simple technique (or tool) to test site structures. The most common method used — closed card sorting — is neither widespread nor particularly suited to this task.

Some years ago, Donna Spencer pioneered a simple paper-based technique to test trees of topics. Recent refinements to that method, some made possible by online experimentation, have now made “tree testing” more effective and agile.

How it all began

Some time ago, we were working on an information-architecture project for a large government client here in New Zealand. It was a classic IA situation – their current site’s structure (the hierarchical “tree” of topics) was a mess, they knew they had outgrown it, and they wanted to start fresh.

We jumped in and did some research, including card-sorting exercises with various user groups. We’ve always found card sorts (in person or online) to be a great way to generate ideas for a new IA.

Brainstorming sessions followed, and we worked with the client to come up with several possible new site trees. But were they better than the old one? And which new one was best? After a certain amount of debate, it became clear that debate wasn’t the way to decide. We needed some real data – data from users. And, like all projects, we needed it quickly.

What kind of data? At this early stage, we weren’t concerned with visual design or navigation methods; we just wanted to test organisation – specifically, findability and labeling. We wanted to know:
* Could users successfully find particular items in the tree?
* Could they find those items directly, without having to backtrack?
* Could they choose between topics quickly, without having to think too much (the Krug Test)¹?
* Overall, which parts of the tree worked well, and which fell down?

Not only did we want to test each proposed tree, we wanted to test them against each other, so we could pick the best ideas from each.

And finally, we needed to test the proposed trees against the existing tree. After all, we hadn’t just contracted to deliver a different IA – we had promised a better IA, and we needed a quantifiable way to prove it.

The problem

This, then, was our IA challenge:
* getting objective data on the relative effectiveness of several tree structures
* getting it done quickly, without having to build the actual site first.

As mentioned earlier, we had already used open card sorting to generate ideas for the new site structure. We had done in-person sorts (to get some of the “why” behind our users’ mental models) as well as online sorts (to get a larger sample from a wider range of users).

But while open card sorting is a good “detective” technique, it doesn’t yield the final site structure – it just provides clues and ideas. And it certainly doesn’t help in evaluating structures.

For that, information architects have traditionally turned to closed card sorting, where the user is provided with predefined category “buckets” and ask to sort a pile of content cards into those buckets. The thinking goes that if there is general agreement about which cards go in which buckets, then the buckets (the categories) should perform well in the delivered IA.

The problem here is that, while closed card sorting mimics how users may file a particular item of content (e.g. where they might store a new document in a document-management system), it doesn’t necessarily model how users find information in a site. They don’t start with a document — they start with a task, just as they do in a usability test.

What we wanted was a technique that more closely simulates how users browse sites when looking for something specific. Yes, closed card sorting was better than nothing, but it just didn’t feel like the right approach.

Other information architects have grappled with this same problem. We know some who wait until they are far enough along in the wireframing process that they can include some IA testing in the first rounds of usability testing. That piggybacking saves effort, but it also means that we don’t get to evaluate the IA until later in the design process, which means more risk.

We know others who have thrown together quick-and-dirty HTML with a proposed site structure and placeholder content. This lets them run early usability tests that focus on how easily participants can find various sublevels of the site. While that gets results sooner, it also means creating a throw-away set of pages and running an extra round of user testing.

With these needs in mind, we looked for a new technique – one that could:
* Test topic trees for effective organisation
* Provide a way to compare alternative trees
* Be set up and run with minimal time and effort
* Give clear results that could be acted on quickly

The technique — tree testing

Luckily, the technique we were looking for already existed. Even luckier was that we got to hear about it firsthand from its inventor, Donna Spencer, the well-regarded information architect out of Australia, and author of the recently released book “Card Sorting”:http://rosenfeldmedia.com/books/cardsorting/.

During an IA course that Donna was teaching, she was asked how she tested the site structures she created for clients. She mentioned closed card sorting, but like us, she wasn’t satisfied with it.

She then went on to describe a technique she called “card-based classification”:http://www.boxesandarrows.com/view/card_based_classification_evaluation, which she had used on some of her IA projects. Basically, it involved modeling the site structure on index cards, then giving participants a “find-it” task and asking them to navigate through the index cards until they found what they were looking for.

To test a shopping site, for example, she might give them a task like “Your 9-year-old son asks for a new belt with a cowboy buckle”. She would then show them an index card with the top-level categories of the site:

The participant would choose a topic from that card, leading to another index card with the subtopics under that topic.

The participant would continue choosing topics, moving down the tree, until they found their answer. If they didn’t find a topic that satisfied them, they could backtrack (go back up one or more levels). If they still couldn’t find what they were looking for, they could give up and move on to the next task.

During the task, the moderator would record:
* the path taken through the tree (using the reference numbers on the cards)
* whether the participant found the correct topic
* where the participant hesitated or backtracked

By choosing a small number of representative tasks to try on participants, Donna found that she could quickly determine which parts of the tree performed well and which were letting the side down. And she could do this without building the site itself – all that was needed was a textual structure, some tasks, and a bunch of index cards.

Donna was careful to point out that this technique only tests the top-down organisation of a site and the labeling of its topics. It does not try to include other factors that affect findability, such as:
* the visual design and layout of the site
* other navigation routes (e.g. cross links)
* search

While it’s true that this technique does not measure everything that determines a site’s ease of browsing, that can also be a strength. By isolating the site structure – by removing other variables at this early stage of design – we can more clearly see how the tree itself performs, and revise until we have a solid structure. We can then move on in the design process with confidence. It’s like unit-testing a site’s organisation and labeling. Or as my colleague Sam Ng says, “Think of it as analytics for a website you haven’t built yet.”

So we built Treejack

As we started experimenting with “card-based classification” on paper, it became clear that, while the technique was simple, it was tedious to create the cards on paper, recruit participants, record the results manually, and enter the data into a spreadsheet for analysis. The steps were easy enough, but they were time eaters.

It didn’t take too much to imagine all this turned into a web app – both for the information architect running the study and the participant browsing the tree. Card sorting had gone online with good results, so why not card-based classification?

Ah yes, that was the other thing that needed work – the name. During the paper exercises, it got called “tree testing”, and because that seemed to stick with participants and clients, it stuck with us. And it sure is a lot easier to type.

To create a good web app, we knew we had to be absolutely clear about what it was supposed to do. For online tree testing, we aimed for something that was:
* Quick for an information architect to learn and get going on
* Simple for participants to do the test
* Able to handle a large sample of users
* Able to present clear results

We created a rudimentary application as a proof of concept, running a few client pilots to see how well tree testing worked online. After working with the results in Excel, it became very clear which parts of the trees were failing users, and how they were failing. The technique worked.

However, it also became obvious that a wall of spreadsheet data did not qualify as “clear results”. So when we sat down to design the next version of the tool – the version that information architects could use to run their own tree tests – reworking the results was our number-one priority.

Participating in an online tree test

So, what does online tree testing look like? Let’s look at what a participant sees.

Suppose we’ve emailed an invitation to a list of possible participants. (We recommend at least 30 to get reasonable results – more is good, especially if you have different types of users.) Clicking a link in that email takes them to the Treejack site, where they’re welcomed and instructed in what to do.

Once they start the test, they’ll see a task to perform. The tree is presented as a simple list of top-level topics:

They click down the tree one topic at a time. Each click shows them the next level of the tree:

Once they click to the end of a branch, they have 3 choices:
* Choose the current topic as their answer (“I’d find it here”).
* Go back up the tree and try a different path (by clicking a higher-level topic).
* Give up on this task and move to the next one (“Skip this task”).

Once they’ve finished all the tasks, they’re done – that’s it. For a typical test of 10 tasks on a medium-sized tree, most participants take 5-10 minutes. As a bonus, we’ve found that participants usually find tree tests less taxing than card sorts, so we get lower drop-out rates.

Creating a tree test

The heart of a tree test is…um…the tree, modeled as a list of text topics.

One lesson that we learned early was to build the tree based on the content of the site, not simply its page structure. Any implicit in-page content should be turned into explicit topics in the tree, so that participants can “see” and select those topics.

Also, because we want to measure the effectiveness of the site’s topic structure, we typically omit “helper” topics such as Search, Site Map, Help, and Contact Us. If we leave them in, it makes it too easy for users to choose them as alternatives to browsing the tree.

Devising tasks

We test the tree by getting participants to look for specific things – to perform “find it” tasks. Just as in a usability test, a good task is clear, specific, and representative of the tasks that actual users will do on the real site.

How many tasks? You might think that more is better, but we’ve found a sizable learning effect in tree tests. After a participant has browsed through the tree several times looking for various items, they start to remember where things are, and that can skew later tasks. For that reason, we recommend about 10 tasks per test, presented in a random sequence.

Finally, for each task, we select the correct answers – 1 or more tree topics that satisfy that task.

The results

So we’ve run a tree test. How did the tree fare?

At a high level, we look at:
* Success – % of participants who found the correct answer. This is the single most important metric, and is weighted highest in the overall score.
* Speed – how fast participants clicked through the tree. In general, confident choices are made quickly (i.e. a high Speed score), while hesitation suggests that the topics are either not clear enough or not distinguishable enough.
* Directness – how directly participants made it to the answer. Ideally, they reach their destination without wandering or backtracking.

For each task, we see a percentage score on each of these measures, along with an aggregate score (out of 10):

If we see an overall score of 8/10 for the entire test, we’ve earned ourselves a beer. Often, though, we’ll find ourselves looking at a 5 or 6, and realise that there’s more work to be done.

The good news is that our miserable overall score of 5/10 is often some 8’s and 9’s brought down by a few 2’s and 3’s. This is where tree testing really shines — separating the good parts of the tree from the bad, so we can spend our time and effort fixing the latter.

To do more detailed analysis on the low scores, we can download the data as a spreadsheet, showing destinations for each task, first clicks, full click paths, and so on.

In general, we’ve found that tree-testing results are much easier to analyse than card-sorting results. The high-level results pinpoint where the problems are, and the detailed results usually make the reason plain. In cases where a result has us scratching our heads, we do a few in-person tree tests, prompting the participant to think aloud and asking them about the reasons behind their choices.

Lessons learned

We’ve run several tree tests now for large clients, and we’re very pleased with the technique. Along the way, we’ve learned a few things too:
* Test a few different alternatives. Because tree tests are quick to do, we can take several proposed structures and test them against each other. This is a quick way of resolving opinion-based debates over which is better. For the government web project we discussed earlier, one proposed structure had much lower success rates than the others, so we were able to discard it without regrets or doubts.

* Test new against old. Remember how we promised that government agency that we would deliver a better IA, not just a different one? Tree testing proved to be a great way to demonstrate this. In our baseline test, the original structure notched a 31% success rate. Using the same tasks, the new structure scored 67% – a solid quantitative improvement.

* Do iterations. Everyone talks about developing designs iteratively, but schedules and budgets often quash that ideal. Tree testing, on the other hand, has proved quick enough that we’ve been able to do two or three revision cycles for a given tree, using each set of results to progressively tweak and improve it.

* Identify critical areas to test, and tailor your tasks to exercise them. Normally we try to cover all parts of the tree with our tasks. If, however, there are certain sections that are especially critical, it’s a good idea to run more tasks that involve those sections. That can reveal subtleties that you may have missed with a “vanilla” test. For example, in another study we did, the client was considering renaming an important top-level section, but was worried that the new term (while more accurate) was less clear. Tree testing showed both terms to be equally effective, so the client was free to choose based on other criteria.

* Crack the toughest nuts with “live” testing. Online tree tests suffer from the same basic limitation as most other online studies – they give us loads of useful data, but not always the “why” behind it. Moderated testing (either in person or by remote session) can fill in this gap when it occurs.

Conclusion

Tree testing has given us the IA method we were after – a quick, clear, quantitative way to test site structures. Like user testing, it shows us (and our clients) where we need to focus our efforts, and injects some user-based data into our IA design process. The simplicity of the technique lets us do variations and iterations until we get a really good result.

Tree testing also makes our clients happy. They quickly “get” the concept, the high-level results are easy for them to understand, and they love having data to show their management and to measure their progress against.

You can sign up for a free Treejack account at “Optimal Workshop”:http://www.optimalworkshop.com/treejack.htm.²

References

1. “Don’t Make Me Think”:http://www.amazon.com/Dont-Make-Me-Think-Usability/dp/0321344758, Steve Krug
2. Full disclosure: As noted in his “bio”:http://boxesandarrows.wpengine.com/person/35384-daveobrien, O’Brien works with Optimal Workshop.

The post Tree Testing appeared first on Boxes and Arrows.