How to Index and Process JSON Data for Hassle-free Business Insights
If your IT department is generating a tsunami of JSON-based log and event data, ChaosSearch® JSON FLEX® can fast-track automatic, flexible indexing for custom insights of your valuable business data.
The challenge is keeping up with a steady volume of thousands, millions, and even billions of events generated. Analysts must have an easy way to condense that volume into compact, searchable data that they can quickly and easily query and visualize. For JSON data, in particular, nested arrays and nested properties support very rich information layers, but converting those layers into searchable two-dimensional representations is not straightforward. There's always the risk of the JSON permutation explosion—just one nested JSON record could quickly balloon into a million rows, or a million columns, of indexed data to represent that information.
JSON FLEX® leverages the advantages of the Chaos Index® technology to uniquely represent the dimensionality of JSON. With JSON FLEX, administrators have the tools to selectively control the indexing of JSON source, creating the highly optimized Chaos Index data for in-depth querying of data at scale, and offering flexible ways to unlock value for the end-user analysts.
JSON FLEX tackles the JSON indexing and searching challenges from two important vectors:
- Flexible Chaos Index® choices that can process and efficiently store JSON structures in the patented ChaosSearch indexed data format
- Flexible Chaos Refinery® views with schema-on-read transformation features that empower analysts and end-users to specify, quickly search, and visualize their important business data
WATCH: Unlock JSON Files for Analytics at Scale in ChaosSearch
The patented ChaosSearch Index and data analysis solution includes proven features for filtering, indexing, and creating that compact Chaos Index data. The ChaosSearch core design champions born-in-the-cloud behaviors like scaling, high availability, and centralized processing to keep our services close to your cloud storage and the driving principle to always keep your data inside your cloud storage.
When JSON events are in the mix, ChaosSearch adds additional, powerful JSON-processing features.
Options to selectively apply JSON array expansion rules
Avoid the JSON permutation explosion—not by leaving out important data from source files—but by using the Chaos Index to flatten some arrays horizontally for storage, some vertically for filtering and aggregation usage, and some to JSON string blobs when the data might be valuable for search and query results.
Rules to exclude (or include only) the target JSON file content
Keep source data intact to avoid costly re-pipelining to scale down content. Use Chaos Index rules to specify the arrays and properties you want to index within an object group, reduce index storage footprint, maximize scan performance, and filter out unneeded data. If other analysts wish to evaluate different or excluded arrays from the same source files, they could create their own object group and indexed data with varying inclusion rules.
Views with JSON transformations and schema-on-read features in the Refinery
Materialize columns from the content of JSON strings, to make a JSON string field searchable with an Elastic nested query path, or JSON Array Transformation—the ability to transform a horizontally indexed array to a virtual vertical array. JSON Array Transformation lets you take advantage of the storage benefits of horizontal expansion and the analysis features of vertical expansion, all as a schema-on-read view materialization. Different business analysts can create their own views with different transformation rules to mine the insights they want from the same indexed data.
It's a lot to take in, but we're excited about the JSON FLEX capabilities and how JSON FLEX can help organizations with JSON files stacking up in their cloud storage. Don't let your valuable JSON log and event data go unused. Don't spend critical time and storage costs with log shipper duplications trying to work around the JSON permutation explosion. Try JSON FLEX.
Want to learn more about how ChaosSearch works?
Read the Blog: Best Practices for Effective Log Management
Listen to the Podcast: The Data Management Triangle: Lake, Warehouse, Virtualization
Check out the Whitepaper: Top Strategic Technology Trends for 2023: Applied Observability