From: "ZheNing Hu via GitGitGadget" <gitgitgadget@gmail.com> To: git@vger.kernel.org Cc: "Junio C Hamano" <gitster@pobox.com>, "Christian Couder" <christian.couder@gmail.com>, "Hariom Verma" <hariom18599@gmail.com>, "Bagas Sanjaya" <bagasdotme@gmail.com>, "Jeff King" <peff@peff.net>, "Ævar Arnfjörð Bjarmason" <avarab@gmail.com>, "Eric Sunshine" <sunshine@sunshineco.com>, "Philip Oakley" <philipoakley@iee.email>, "ZheNing Hu" <adlternative@gmail.com>, "ZheNing Hu" <adlternative@gmail.com> Subject: [PATCH 3/5] [GSOC] ref-filter: reuse final buffer Date: Tue, 17 Aug 2021 07:14:47 +0000 [thread overview] Message-ID: <3760ff032bb1dec3812881fd408f8d78ec125477.1629184489.git.gitgitgadget@gmail.com> (raw) In-Reply-To: <pull.1020.git.1629184489.gitgitgadget@gmail.com> From: ZheNing Hu <adlternative@gmail.com> In format_ref_array_item(), we add the object data to ref_formatting_state, and copy the data from ref_formatting_state to final_buf at the end. There are huge copies of data. Because final_buf will be cleared before every time we call format_ref_array_item(). So we actually add content to an empty strbuf. We can add the object's data directly to this final_buffer instead of adding objects' data to state.stack->output first, then copy to final_buf. Add a can_reuse_final_buffer flag to struct ref_format and create can_reuse_final_buffer() to check if we are use %(align), %(end), %(if), %(then), %(else). If not, we can reuse the buf of finnal_buf. Reuse the buffer address of final_buf in format_ref_array_item(), we directly add the data to the final buffer, return the content to final_buf at the end. This will bring performance improvements. Mentored-by: Christian Couder <christian.couder@gmail.com> Mentored-by: Hariom Verma <hariom18599@gmail.com> Signed-off-by: ZheNing Hu <adlternative@gmail.com> --- ref-filter.c | 39 ++++++++++++++++++++++++++++++++++----- ref-filter.h | 3 ++- 2 files changed, 36 insertions(+), 6 deletions(-) diff --git a/ref-filter.c b/ref-filter.c index 76a31fb79b1..7106d4c1c4c 100644 --- a/ref-filter.c +++ b/ref-filter.c @@ -1023,6 +1023,19 @@ static int need_parse_buffer(enum atom_type atom_type) { } } +static int can_reuse_final_buffer(enum atom_type atom_type) { + switch (atom_type) { + case ATOM_ALIGN: + case ATOM_END: + case ATOM_IF: + case ATOM_THEN: + case ATOM_ELSE: + return 0; + default: + return 1; + } +} + /* * Make sure the format string is well formed, and parse out * the used atoms. @@ -1054,6 +1067,7 @@ int verify_ref_format(struct ref_format *format) format->can_skip_parse_buffer = 0; if (reject_atom(used_atom[at].atom_type)) die(_("this command reject atom %%(%.*s)"), (int)(ep - sp - 2), sp + 2); + format->can_reuse_final_buffer = can_reuse_final_buffer(used_atom[at].atom_type); if ((format->quote_style == QUOTE_PYTHON || format->quote_style == QUOTE_SHELL || @@ -2627,7 +2641,14 @@ int format_ref_array_item(struct ref_array_item *info, struct ref_formatting_state state = REF_FORMATTING_STATE_INIT; state.quote_style = format->quote_style; - push_stack_element(&state.stack); + if (format->can_reuse_final_buffer) { + struct ref_formatting_stack *s = xmalloc(sizeof(struct ref_formatting_stack)); + s->output = *final_buf; + s->prev = state.stack; + state.stack = s; + } else { + push_stack_element(&state.stack); + } info->can_skip_parse_buffer = format->can_skip_parse_buffer; cp = format->format; @@ -2641,7 +2662,8 @@ int format_ref_array_item(struct ref_array_item *info, append_literal(cp, e->beg - 2, &state); if (get_ref_atom_value(info, e->at, &atomv, error_buf) || atomv->handler(atomv, &state, error_buf)) { - pop_stack_element(&state.stack); + if (!format->can_reuse_final_buffer) + pop_stack_element(&state.stack); return -1; } cp = e->end + 1; @@ -2656,16 +2678,23 @@ int format_ref_array_item(struct ref_array_item *info, struct atom_value resetv = ATOM_VALUE_INIT; resetv.s = GIT_COLOR_RESET; if (append_atom(&resetv, &state, error_buf)) { - pop_stack_element(&state.stack); + if (!format->can_reuse_final_buffer) + pop_stack_element(&state.stack); return -1; } } if (state.stack->prev) { + assert(!format->can_reuse_final_buffer); pop_stack_element(&state.stack); return strbuf_addf_ret(error_buf, -1, _("format: %%(end) atom missing")); } - strbuf_addbuf(final_buf, &state.stack->output); - pop_stack_element(&state.stack); + if(format->can_reuse_final_buffer) { + *final_buf = state.stack->output; + free(state.stack); + } else { + strbuf_addbuf(final_buf, &state.stack->output); + pop_stack_element(&state.stack); + } return 0; } diff --git a/ref-filter.h b/ref-filter.h index df54836a643..a62a14a2e43 100644 --- a/ref-filter.h +++ b/ref-filter.h @@ -92,10 +92,11 @@ struct ref_format { int can_skip_parse_buffer; /* Internal state to ref-filter */ int need_color_reset_at_eol; + int can_reuse_final_buffer; struct list_head parsed_atom_head; }; -#define REF_FORMAT_INIT { .use_color = -1, .can_skip_parse_buffer = 1 } +#define REF_FORMAT_INIT { .use_color = -1, .can_skip_parse_buffer = 1, .can_reuse_final_buffer = 1 } /* Macros for checking --merged and --no-merged options */ #define _OPT_MERGED_NO_MERGED(option, filter, h) \ -- gitgitgadget
next prev parent reply other threads:[~2021-08-17 7:14 UTC|newest] Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top 2021-08-17 7:14 [PATCH 0/5] [GSOC] [RFC] ref-filter: performance optimization ZheNing Hu via GitGitGadget 2021-08-17 7:14 ` [PATCH 1/5] [GSOC] ref-filter: skip parse_object_buffer in some cases ZheNing Hu via GitGitGadget 2021-08-17 7:14 ` [PATCH 2/5] [GSOC] ref-filter: remove second parsing in format_ref_array_item ZheNing Hu via GitGitGadget 2021-08-17 7:14 ` ZheNing Hu via GitGitGadget [this message] 2021-08-17 7:14 ` [PATCH 4/5] [GSOC] ref-filter: reduce unnecessary object_info comparisons ZheNing Hu via GitGitGadget 2021-08-17 7:14 ` [PATCH 5/5] [GSOC]: ref-filter: instead CALLOC_ARRAY to ALLOC_ARRAY ZheNing Hu via GitGitGadget
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=3760ff032bb1dec3812881fd408f8d78ec125477.1629184489.git.gitgitgadget@gmail.com \ --to=gitgitgadget@gmail.com \ --cc=adlternative@gmail.com \ --cc=avarab@gmail.com \ --cc=bagasdotme@gmail.com \ --cc=christian.couder@gmail.com \ --cc=git@vger.kernel.org \ --cc=gitster@pobox.com \ --cc=hariom18599@gmail.com \ --cc=peff@peff.net \ --cc=philipoakley@iee.email \ --cc=sunshine@sunshineco.com \ --subject='Re: [PATCH 3/5] [GSOC] ref-filter: reuse final buffer' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).