Parse Json — axiomhq/docs

The parse_json function interprets a string as JSON and returns the value as a dynamic object. Use this function to extract structured data from JSON-formatted log entries, API responses, or configuration values stored as JSON strings.

For users of other query languages

If you come from other query languages, this section explains how to adjust your existing queries to achieve the same results in APL.

In Splunk SPL, you use spath to parse JSON. APL's parse_json provides similar functionality with dynamic object support.

```sql Splunk example | spath input=json_field | rename "field.name" as extracted_value ```

['sample-http-logs']
| extend parsed = parse_json(json_field)
| extend extracted_value = parsed['field']['name']

In ANSI SQL, JSON parsing varies by database with different functions. APL's parse_json provides standardized JSON parsing.

```sql SQL example SELECT JSON_EXTRACT(json_field, '$.field.name') AS extracted_value FROM logs; ```

['sample-http-logs']
| extend parsed = parse_json(json_field)
| extend extracted_value = parsed.field.name

Usage

Syntax

parse_json(json_string)

Parameters

Name	Type	Required	Description
json_string	string	Yes	A string containing valid JSON to parse.

Returns

Returns a dynamic object representing the parsed JSON. If the JSON is invalid, returns the original string.

Use case examples

Parse JSON-formatted log messages to extract specific fields for analysis.

Query

['sample-http-logs']
| extend json_data = parse_json('{"response_time": 145, "cache_hit": true, "endpoint": "/api/users"}')
| extend response_time = toint(json_data.response_time)
| extend cache_hit = tobool(json_data.cache_hit)
| extend endpoint = tostring(json_data.endpoint)
| project _time, response_time, cache_hit, endpoint, status
| limit 10

Run in Playground

Output

_time	response_time	cache_hit	endpoint	status
2024-11-06T10:00:00Z	145	true	/api/users	200
2024-11-06T10:01:00Z	145	true	/api/users	200

This query parses JSON-formatted metadata from logs to extract performance metrics like response time and cache hit status.

Extract structured attributes from JSON-formatted span data.

Query

['otel-demo-traces']
| extend attrs = parse_json('{"http.method": "GET", "http.status_code": 200, "user.id": "12345"}')
| extend http_method = tostring(attrs['http.method'])
| extend http_status = toint(attrs['http.status_code'])
| extend user_id = tostring(attrs['user.id'])
| summarize span_count = count() by http_method, http_status
| sort by span_count desc
| limit 10

Run in Playground

Output

http_method	http_status	span_count
GET	200	8765

This query parses JSON attributes from OpenTelemetry spans to analyze HTTP request patterns.

Parse JSON-formatted security events to extract threat indicators.

Query

['sample-http-logs']
| extend security_data = parse_json('{"threat_level": "high", "attack_type": "sql_injection", "blocked": true}')
| extend threat_level = tostring(security_data.threat_level)
| extend attack_type = tostring(security_data.attack_type)
| extend blocked = tobool(security_data.blocked)
| project _time, uri, threat_level, attack_type, blocked, id, ['geo.country']
| limit 10

Run in Playground

Output

_time	uri	threat_level	attack_type	blocked	id	geo.country
2024-11-06T10:00:00Z	/api/users	high	sql_injection	true	user123	Unknown
2024-11-06T10:01:00Z	/admin	high	sql_injection	true	user456	Russia

This query parses JSON-formatted security events to extract and analyze threat information from failed access attempts.

Best practices

When working with JSON data in Axiom, consider the following best practices:

Prefer structured ingestion over runtime parsing: If possible, structure your JSON data as separate fields during ingestion rather than storing it as a stringified JSON object. This provides better query performance and enables indexing on nested fields.
Use map fields for nested data: For nested or unpredictable JSON structures, consider using map fields instead of stringified JSON. Map fields allow you to query nested properties directly without using parse_json at query time.
Avoid mixed types: When logging JSON data, ensure consistent field types across events. Mixed types (for example, sometimes a string, sometimes a number) can cause query issues. Use type conversion functions like toint or tostring when necessary.
Performance considerations: Using parse_json at query time adds CPU overhead. For frequently queried JSON data, consider parsing during ingestion or using map fields for better performance.

parse_url: Parses URLs into components. Use this specifically for URL parsing rather than general JSON.
parse_csv: Parses CSV strings. Use this for comma-separated values rather than JSON.
todynamic: Alias for parse_json. Use either name based on your preference.
gettype: Returns the type of a value. Use this to check the types of parsed JSON fields.

#For users of other query languages

#Usage

#Syntax

#Parameters

#Returns

#Use case examples

#Best practices

#List of related functions

Good evening

For users of other query languages

Usage

Syntax

Parameters

Returns

Use case examples

Best practices

List of related functions