The compression ratio can vary greatly depending on source data. The only way to determine the actual compression rate of source data is to empirically determine it by indexing it in Splunk. A standard figure of merit to use is 0.50 or a reduction of 50% in size.