This message originated from Cribl Community Slack.
Click here to view the original link.
Hello team! I’d like to understand the difference between sending only the _raw field to Lake, and sending _raw along with some extracted/parsed fields. I assume it will affect the size of the event, but more importantly, how does it affect the performance in Cribl Search?
Additionally, are there any recommendations for using JSON/Parquet data format?
Solved
Cribl Search Performance Impact When Sending Only _raw Field Versus Extracted Fields
Best answer by jinsea
If you send in _raw and then some parsed fields from that _raw you are going to be duplicating data.
Inside the cribl lake you can do things like
dataset="staging" AND source_name="syslog_source" | extend raw = parse_json(_raw)
in your KQL to more easily see the _raw data :smile:Enter your E-mail address. We'll send you an e-mail with instructions to reset your password.
