By Catherine Heath on Writing docs from August 10, 2018
Knowledge base categorization is an important part of Information Architecture. Your categories create order out of chaos and reduce complexity.
Many technical writers struggle with both categorizing and labelling documentation. They wonder how their users consume information, and what terms their users use for parts of your organization or product.
The answer is: it doesn’t matter that much. That’s because Information Architecture is a map that you give your users to help them navigate your documentation more purposely and efficiently.
The map is based on a set of assumptions most people hold about:
If you haven’t already, learn how to create Information Architecture for your knowledge base. Then read on to learn how to categorize your knowledge base.
The Cambridge Dictionary defines categorization as:
“To put people or things into groups with the same features.”
This gets us off to a good start, but doesn’t really tell us how to categorize our content, or what the ultimate purpose of categorizing might be.
Turning to Wikipedia, we now have an explanation that categorization means:
“Categorization is the process in which ideas and objects are recognized, differentiated, and understood. Categorization implies that objects are grouped into categories, usually for some specific purpose. Ideally, a category illuminates a relationship between the subjects and objects of knowledge. Categorization is fundamental in language, prediction, inference, decision making and in all kinds of environmental interaction. It is indicated that categorization plays a major role in computer programming.”
We can understand categorization by thinking about taxonomy – the practice and science of classifying things or concepts, and the principles that underlie them.
Categorization can be thought of as part of your taxonomy for the concepts within your complex information object – your knowledge base. Your categories are part of your taxonomy and they are limited to a specific knowledge domain, which forms the outer boundaries of your taxonomy.
Look at this taxonomy for British sharks:
The knowledge domain is “British sharks”. The creator has included all the sharks within this domain, and they are categorized according to their belonging in particular subcategories. The subcategories are defined by the presence of certain biological features.
In the context of taxonomy, categories are used as:
The categories in your taxonomy should be “mutually exclusive and unambiguous”, and include all possibilities (all of your content) when taken as a whole.
Your taxonomy should be:
Your categories are the zones in your Information Architecture map. They orient your user by telling them where they are. Categorization is how you organize your documentation into meaningful groups.
Category labels serve as “markers” to your users showing what “stop” they’re at, and suggesting where they could go next.
It means labelling your documentation categories with evocative terms. Calling your categories “One”, “Two” and “Three” for example is not useful at all. Call them things that mean something to your users.
For a typical ecommerce company, that might be:
These basic top-level categories are mostly defined by the stage in the purchase process, and are broad enough to include all your documentation.
You don’t need to know the minds of your users back to front to create a good taxonomy. It should ideally be similar to other “taxonomies” they have seen before for maximum efficiency.
Consistency is key, and as long as you have empathy you will be able to categorize effectively. Organisational objectives are important, but they are the least relevant factor when it comes to categorization.
Your categories reflect the relationship between your customers’ needs, and the knowledge domain of your site. The knowledge domain is likely to be your organisation’s work, or a product along with features and related concepts. Your users’ needs are whatever they have come to you for.
External knowledge bases often represent a particular product or service. This means you can organise your categories by features, or come up with specific areas of issues that your users typically encounter.
Not only will you have information about your product, but there might also be related concepts, functionality, and use cases.
If your knowledge base is for customer support software, your categories might be:
These are fairly familiar categories that orient your user right away. Processes don’t always sit within a single team. For example, you might want to have all documentation relating to “product support” in one place – regardless of whether a dedicated team exists.
If your knowledge base is internal, you are more likely to categorize it according to team domain. This is tempting but it’s usually best to focus first on user needs then work backwards. If your user base is particularly diverse, as in a large university, then you can provide the initial framework of understanding.
For a university admin department, your categories could be:
Their categories must encompass admin staff, academic staff, library staff, maintenance staff, current students, former students, and many more possible users. In this case it’s better to stick with organizational structure.
Ideally, you should label your categories with the internal processes you use in your company rather than basing it purely on internal teams. For example, “financial” is more descriptive and evocative than “accounts department”.
Ask your customers some questions to gather more information about how they interact with your product or service. Use your empathy, and combine it with any analytics you have to decide on the best categories for your knowledge.
Most knowledge bases have content grouped using level of “depth” or “granularity”. Granularity is size relative to the whole.
Imagine an atom – it is made up of different-sized particles, each of which can be broken down into smaller particles. These subatomic particles are:
Content at a similar “depth” level should be grouped together.
What if you don’t have a pre-existing system that you can base your categories on, such as biological diversity or subatomic particles? You can decide that you want to have five top-level categories and draw up the boundaries based on having five roughly equal-sized segments.
You can use formatting to distinguish between the different content tiers. Introductory content can be shorter and more visual, written in a more conversational tone. It might be written in a way that is aimed at the more general or beginner user less patient with your product.
The “depth level” of your categories must orient your user. This means your category names must be of an equal level of depth if they are on the same tier – top level, second level down, third level down, etc.
This equality of depth (or weight) builds consistency in your system, and contributes to a sense that there is an intelligent design.
Use your intuitive understanding of the emotional significance of words to help you with this process. The words for your top-level categories should be “symbolic” rather than “descriptive”. They are concepts rather than actions.
The meaning of words also depends on the context where you find them. For example, the word “community” is more significant in the context of the world as a whole, than in your local neighbourhood. Nevertheless the concepts are still related.
A piece of content is a single unit of documentation such as an article or a video. It can be thought of as a “stop” on your map.
Content that is conceptually related can be grouped together into categories. For example, content all related to accomplishing a certain task, such as managing a user account. Categories may be a whole “zone” in themselves or part of a larger zone.
Categories function as signposts for your user directing them towards particular types of content that have been placed together. How well this mechanism works depends on whether you have used terminology that is both evocative and familiar to your users.
Top-level categories tend to be labelled with short words or phrases that encompass a large body of documentation. They are broad and non-specific.
This means your top level categories should be broad and sweeping to encompass a large variety of content, such as ‘your account’ and ‘analytics’.
They should not be called things like ‘editor permission role’, ‘changing your contact details’, and ‘how to find your total unique visitors’.
Do you see the difference?
Your top-level categories should be broad enough areas that your customers will expect to see when they get to your knowledge base.
Take MailChimp as an example:
Stripe may hold the position for most-loved documentation. With the Stripe docs, they present their top-level categories as: Payments, Subscriptions, Connect and Radar, because these are what they identify as the most important areas for their users.
Wave has grouped its content based on its different purposes for the user: getting started, guides and reference.
They should reflect your product, but also mirror other knowledge bases out there. Using expected taxonomies capitalizes on your customers’ prior experience, and reduces the amount of effort it takes to learn a new system.
At the same time, you need to present your topics in a natural and logical way. Just because everyone else is doing it, doesn’t mean it’s the right way. Your top-level categories should also reflect how customers interact with your company, and what they would consider the most important topics for them to learn about. Empathy is absolutely key here.
Try to keep your total number of top-level categories to less than ten (five, if you can manage it). This is for simplicity.
Also, ensure they are similarly significant to warrant being included as a top-level category.
Once you’ve decided on your broad, top-level categories, these need to be divided into subcategories. Your subcategories are what separates each tier of your content from each other.
These are usually hidden at first to reduce the complexity of the information that users need to see. Users can click on each category to reveal its contents.
Github does it like this:
Their method is actually quite overwhelming because you can see too much all at once. Bitbucket works harder to keep some complexity hidden:
The more complex your product is, the more likely you are to need lots of subcategories. Bear in mind that not all categories will need to be divided up further and can be just fine on their own. Don’t aim for symmetry just for the sake of it.
In KnowledgeOwl, content pages can also be categories because sometimes a topic needs to be on the same level as a category, but cannot be divided any further.
As users explore deeper and deeper inside your knowledge base, some articles might explain very technical concepts, or cover brief but finely-grained topics.
If you have content buried deep in a subcategory but you think it’s important for some users to see it on the homepage, you can always surface it using quicklinks or frequently used articles.
PRO TIP: If you don’t need to have lots of sub-categories, you can also organise shorter or more granular pieces of content into broader “guides”. Guides cover a wider spectrum of concepts than you’d usually find on a single content page.
“Getting Started” guides are popular with Software as a Service companies, and developer-focused documentation.
Some information is written at a more advanced (or “deeper”) level for more experienced users, or more technical users. It can’t be broken down anymore, and yet it’s hard for this kind of content to “stand alone” – nor should you want it to.
A deep-level content page may link conceptually to other deep-level content contained in separate categories, so in this case you can hyperlink. The link could either be from a relevant term in your documentation, or in the references/footnotes of your documentation.
You can also link to external reference material or a glossary.
Gather as much input as possible from your team when categorizing your knowledge base. Other people may have a perspective that you don’t. They’ll know best what should be in there – at the same time, stand firm about your taxonomy.
Your customer support team will be of particular value in this exercise, since they know your customers and their perspectives.
Once you’ve decided on your categories, show your plan to your team for a final review to ensure it makes sense, and is likely to be useful.
Don’t be too proud to learn. Other people have done similar things before and done them well. Start paying attention to other knowledge bases and how they’re structured to build up a picture of what works well.
Here’s a list of good knowledge bases:
Remember that each knowledge base’s categories are different due to its specific purpose, unique context, and different customer base.
Don’t try to directly copy anyone else’s design, and instead take inspiration from other knowledge bases that you like in order to produce your own appropriate categorization.
If you fail to categorize your documentation well, your users will become very frustrated and think you don’t know them. Over-reliance on the search bar is like expecting your customers to do all the work for you. It also doesn’t help that much with exploration of your product.
Providing them with a conceptual framework – a map – for your knowledge base is empowering customers to take part in an active relationship with your product.
Your knowledge base categories are a crucial part of creating an exceptional support experience for your users. Ensure you use your terms and labels consistently.
Make sure you benchmark your categories against others in the industry and ask your team for their input before you make any final decisions.
Don’t forget to make use of diagrams throughout your process, and particularly the good old-fashioned pen and paper.
General posts useful to all documentarians about writing documentation, editing and publishing workflows, and more.
Your flight plan for how to get the most out of KnowledgeOwl features and integrate them into your workflows.
Major KnowledgeOwl company announcements.
Learn how others are using KnowledgeOwl & get pro tips on how to make the most of KO!
Find out more about who we are and what we value.
We believe good support is the foundation of good business. Learn about support tools and methodology.
Learn more about tools to solve various documentarian issues, within and beyond KnowledgeOwl.
Not sure what category you need? Browse all the posts on our blog.
Watch a 5-minute video and schedule time to speak with one of our owls.