WIP: MDEV-34041 Display additional information for materialized subqueries… #3241

Olernov · 2024-05-04T13:07:30Z

… in EXPLAIN/ANALYZE FORMAT=JSON

The Jira issue number for this PR is: MDEV-34041

Description

This is a WIP (work-in-progress) PR, that's why the test suite is not updated. The ANALYZE output format is open for discussion, so it makes sense to update existing tests after the output format is approved

Release Notes

TODO: What should the release notes say about this change?
Include any changed system variables, status variables or behaviour. Optionally list any https://mariadb.com/kb/ pages that need changing.

How can this PR be tested?

TODO: modify the automated test suite to verify that the PR causes MariaDB to behave as intended.
Consult the documentation on "Writing good test cases".

If the changes are not amenable to automated testing, please explain why not and carefully describe how to test manually.

Basing the PR against the correct MariaDB version

This is a new feature and the PR is based against the latest MariaDB development branch.
This is a bug fix and the PR is based against the earliest maintained branch in which the bug can be reproduced.

PR quality check

I checked the CODING_STANDARDS.md file and my PR conforms to this where appropriate.
For any trivial modifications to the PR, I am ok with the reviewer making the changes themselves.

… in EXPLAIN/ANALYZE FORMAT=JSON

spetrunia

Currently, there are two kinds of data structures:

EXPLAIN data structure
Tracker.

Explain_subq_materialization tries to be both. I know this is caused by
the fact that there's really no query plan.

I would still request to have the Explain class and the tracker class
explicitly. Even if the Explain class has only one member, the tracker.

The tracker class should follow other tracker classes:

data members are private.
execution code calls functions to report data
printing json output is done by local members.
constructor initalizes everything to some initial state which
can already be printed (currently it doesn't). This is to allow
SHOW ANALYZE and also the situation where subquery is never invoked.

sql/item_subselect.cc

mysql-test/main/subselect_mat_analyze_json.result

spetrunia · 2024-05-13T13:14:30Z

sql/item_subselect.cc

+  if (is_analyze)
+  {
+    writer->add_member("r_exec_strategy").add_str(
+          exec_strategy_str[exec_strategy]);


is "exec" redundant in r_exec_strategy, can we just use r_strategy ?

Looking at the enum of strategies, I don't see many that make sense:

{ "undefined", "complete_match", "partial_match", "partial_match_merge", "partial_match_scan", "impossible" };

undefined makes sense if the execution never reached the code that determines the strategy.
complete match means "only do index lookup", perhaps we should just print "index_lookup".
partial_match - this does not occur in practice, does it?
partial_match_scan and partial_match_merge are ok.

Agree with r_strategy. The enum of strategies indeed looks excessive, partial_match and impossible don't seem to be used anywhere. Do you think we can shrink subselect_hash_sj_engine::exec_strategy?

I would not touch that in this patch. I would use a switch(...) instead of exec_strategy_str.

spetrunia

I think it also makes sense to introduce r_loops in subquery_materialization (to be materialization)

Olernov · 2024-05-18T14:46:09Z

I think it also makes sense to introduce r_loops in subquery_materialization (to be materialization)

I don't understand what should be counted/tracked under this counter. Every lookup in the materialized subquery?

spetrunia · 2024-05-21T08:23:47Z

I think it also makes sense to introduce r_loops in subquery_materialization (to be materialization)

I don't understand what should be counted/tracked under this counter. Every lookup in the materialized subquery?

Yes. Sometimes, it is possible to infer this number, but in general case it is not available anywhere else. So it makes a lot of sense to print it.

spetrunia · 2024-06-05T13:26:06Z

Also, note that for non-NULL-aware materialization, it reports r_loops==r_index_lookup_loops.

    "materialization": {
      "r_strategy": "index_lookup",
      "r_loops": 10,
      "r_index_lookup_loops": 10,
      "query_block": {

spetrunia

Input so far... @Olernov please wait with addressing it, I will let know when done

spetrunia · 2024-06-04T10:35:46Z

sql/sql_explain.h

@@ -115,6 +116,12 @@ class Explain_node : public Sql_alloc
  */
  Expression_cache_tracker* cache_tracker;

+  /**
+    This tracker is not NULL if the node explains a materialized subquery.


This is not a tracker anymore.

spetrunia · 2024-06-04T10:37:46Z

sql/sql_explain.cc

@@ -2693,3 +2710,45 @@ void Explain_range_checked_fer::print_json(Json_writer *writer,
    writer->end_object();
  }
 }
+
+
+void Explain_subq_materialization::print_explain_json(Json_writer *writer,


If this function gets all the data from tracker, would it make sense to delegate all printing to the tracker?

spetrunia · 2024-06-05T12:55:53Z

sql/item_subselect.cc

  if (in_subs->left_expr_has_null())
  {
    /*
      The case when all values in left_expr are NULL is handled by
      Item_in_optimizer::val_int().
    */
+    if (tracker)
+      tracker->increment_table_scan_loops();


Note that table scan is NOT done if in_subs->is_top_level_item() evaluates to TRUE right below.

This will cause wrong number of table scans to be reported.

... and is this relevant at all ?
Why does subselect_uniquesubquery_engine::exec() access the "materialization_tracker"?

spetrunia

Also, I do not see any testcase with r_table_scan_loops. Please add one. (or remove r_table_scan_loops if it's dead code, that's what I suspect).

spetrunia · 2024-06-06T12:10:32Z

mysql-test/main/subselect_mat_analyze_json.result

+        "materialization": {
+          "r_strategy": "index_lookup",
+          "r_loops": 3,
+          "r_index_lookup_loops": 3,


We should not add "_loops" to all counters.
Please change to "r_index_lookups".

spetrunia · 2024-06-06T12:14:13Z

mysql-test/main/subselect_mat_analyze_json.result

+    "subqueries": [
+      {
+        "materialization": {
+          "r_strategy": "partial_match_merge",


This looks confusing: it is not clear that partial matches are only done when we can't do index lookup.
How about changing to

index_lookup;array merge for partial match

"r_partial_match_loops": 1,
Please change to r_partial_matches.

"r_partial_merge_key_sizes": ["2"],
please change to r_partial_match_array_sizes .

spetrunia · 2024-06-06T12:15:08Z

mysql-test/main/subselect_mat_analyze_json.result

+    "subqueries": [
+      {
+        "materialization": {
+          "r_strategy": "partial_match_scan",


Following the above suggestion, let's change partial_match_scan into

index_lookup;full scan for partial match

spetrunia · 2024-06-06T12:39:43Z

@Olernov , ok I think there are enough comments to get the next version of the patch.

MDEV-34041 Display additional information for materialized subqueries…

5729b3b

… in EXPLAIN/ANALYZE FORMAT=JSON

Olernov requested a review from spetrunia May 4, 2024 13:07

spetrunia requested changes May 13, 2024

View reviewed changes

spetrunia reviewed May 13, 2024

View reviewed changes

sql/item_subselect.cc Outdated Show resolved Hide resolved

spetrunia reviewed May 13, 2024

View reviewed changes

mysql-test/main/subselect_mat_analyze_json.result Outdated Show resolved Hide resolved

spetrunia reviewed May 13, 2024

View reviewed changes

MDEV-32041 Fix code review comments

8b9d336

MDEV-32041 Fix code review comments 2

5e20e26

spetrunia requested changes Jun 5, 2024

View reviewed changes

spetrunia requested changes Jun 6, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: MDEV-34041 Display additional information for materialized subqueries… #3241

WIP: MDEV-34041 Display additional information for materialized subqueries… #3241

Olernov commented May 4, 2024 •

edited

spetrunia left a comment

spetrunia May 13, 2024

Olernov May 18, 2024

spetrunia May 21, 2024

spetrunia left a comment

Olernov commented May 18, 2024

spetrunia commented May 21, 2024

spetrunia commented Jun 5, 2024

spetrunia left a comment

spetrunia Jun 4, 2024

spetrunia Jun 4, 2024

spetrunia Jun 5, 2024

spetrunia Jun 5, 2024

spetrunia Jun 6, 2024

spetrunia left a comment

spetrunia Jun 6, 2024

spetrunia Jun 6, 2024

spetrunia Jun 6, 2024

spetrunia commented Jun 6, 2024

WIP: MDEV-34041 Display additional information for materialized subqueries… #3241

Are you sure you want to change the base?

WIP: MDEV-34041 Display additional information for materialized subqueries… #3241

Conversation

Olernov commented May 4, 2024 • edited

Description

Release Notes

How can this PR be tested?

Basing the PR against the correct MariaDB version

PR quality check

spetrunia left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

spetrunia left a comment

Choose a reason for hiding this comment

Olernov commented May 18, 2024

spetrunia commented May 21, 2024

spetrunia commented Jun 5, 2024

spetrunia left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

spetrunia left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

spetrunia commented Jun 6, 2024

Olernov commented May 4, 2024 •

edited