i965: perf: minimize the chances to spread queries across batchbuffers

author Lionel Landwerlin <lionel.g.landwerlin@intel.com>

Thu, 22 Jun 2017 01:15:50 +0000 (02:15 +0100)

committer Andres Gomez <agomez@igalia.com>

Fri, 25 Aug 2017 13:03:36 +0000 (16:03 +0300)
author Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Thu, 22 Jun 2017 01:15:50 +0000 (02:15 +0100)
committer Andres Gomez <agomez@igalia.com>
Fri, 25 Aug 2017 13:03:36 +0000 (16:03 +0300)
diff --git a/src/mesa/drivers/dri/i965/brw_performance_query.c b/src/mesa/drivers/dri/i965/brw_performance_query.c

index 2f49efa..cbb2da8 100644 (file)
--- a/src/mesa/drivers/dri/i965/brw_performance_query.c
+++ b/src/mesa/drivers/dri/i965/brw_performance_query.c
@@ -1095,6 +1095,14 @@ brw_end_perf_query(struct gl_context *ctx,
                                     obj->oa.begin_report_id + 1);
        }
  
+      /* We flush the batchbuffer here to minimize the chances that MI_RPC
+       * delimiting commands end up in different batchbuffers. If that's the
+       * case, the measurement will include the time it takes for the kernel
+       * scheduler to load a new request into the hardware. This is manifested
+       * in tools like frameretrace by spikes in the "GPU Core Clocks"
+       * counter.
+       */
+      intel_batchbuffer_flush(brw);
        --brw->perfquery.n_active_oa_queries;
  
        /* NB: even though the query has now ended, it can't be accumulated
author	Lionel Landwerlin <lionel.g.landwerlin@intel.com>
	Thu, 22 Jun 2017 01:15:50 +0000 (02:15 +0100)
committer	Andres Gomez <agomez@igalia.com>
	Fri, 25 Aug 2017 13:03:36 +0000 (16:03 +0300)