If you are new to AIIM, you might be wondering what AIIM means when we say "information," which we admittedly say a lot. My favorite explanation of information is from Steve Weissman, CIP, who told me that he simply refers to information as "stuff in a box." Information represents all the data you manage within your organization. Information means both structured and unstructured data.
Structured data has a fixed structure, hence the name and consists of columns and rows of data in a table, or spread across several or many linked tables. For example, data found in spreadsheets or Customer Relationship Management systems is typically structured.
Conversely, unstructured data comes in a variety of formats, like emails, documents, videos, audio, text messages, images, and more. Due to this variety, it's harder to collect, process, and analyze. Hilariously, our analyst friends at Deep Analysis often refer to unstructured data as "ugly data." Unstructured data is vital to organizational operations, but it's messy and undeniably harder to manage.
Unstructured data is typically stored in unstructured repositories or as unstructured data inside of structured systems. Unstructured data is typically much larger in volume than structured data. Some industry experts estimate that unstructured data makes up 80% of an organization's total data.
Importantly, unstructured data is the primary fuel for generative AI applications and because of this, it's been receiving more attention lately.
Ouch...but this question is legitimately important to answer. Ultimately, the difference between structured and unstructured data may have no bearing on business outcomes, but it's important to understand the differences between these two types of data during any sort of technology project involving data, particularly AI implementation.
Our Certified Information Professional Study Guide includes an interesting example of a migration involving unstructured data, such as a migration to Microsoft SharePoint. Any migration involving unstructured data, that is, individual files, is bound to run into issues migrating certain file formats. These issues include:
Intelligent information management solutions can help you structure the unstructured. For example, software can apply natural language processing techniques to transform free-format text within documents into core elements, terms, and characteristics. There are also available market solutions that can scan images, like identification cards or passports, and using optical character recognition translate data into a structured format. Take a look at AIIM's Buyers Guide to find a solution provider to help you solve your structured and unstructured data quandaries.