-
-
Notifications
You must be signed in to change notification settings - Fork 968
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reduce allocations in string handling #9405
Conversation
Intriguing! My immediate thought - will making those string buffers |
Hihi, using OpenTTD as a benchmark application .. things I never expected people to do :D Nice! Question for you: I do not fully understand why that tool considers a stack variable "an allocation"; as that shouldn't be an allocation at all. At least, from what I see in the assembly code, these So I wonder if that tool sees something I am not seeing (very much possible :D), or if some compilers with some optimization settings move the Any insights would be appreciated :) PS: btw, not saying it is such a great idea to have such large variables on the stack, but that is not the subject of this PR :D |
Are there multiple drawing threads? If that is the case, then yes.
This made me scratch my head at first, but could remember some similar situation I once ran into.
|
constexpr int x = DRAW_STRING_BUFFER; Using |
574bcc5
to
cfd70da
Compare
@vituscze Hi, I did some more digging regarding this and in fact you are right. Thanks for pointing that out! I also checked the output from heaptrack again and the buffers are in fact not allocated on the heap. I simply misread the information presented by it. Part of getting used to a new tool, I guess. @LordAro I removed the commit making the buffers static, so no worries about thread safety there. The allocation reduction stays identical though, so the rest of the claims remain valid. |
Doesn't look that way to me. Have you tested with something that actually uses SCC_NEWGRF_STRINL or SCC_NEWGRF_PRINT_WORD_STRING_ID? No idea what uses it though but to me it looks like it should break with this patch as it will throw away any leftover symbols after encountering those string codes. What can probably be done though is postponing stack/vector allocation until it's actually required like how I did it in #7922 |
@ldpl Hmm, right. One could also get rid of the stack by rewriting the function so that it operates on a single formatting string and recursively calls itself for these formatting codes. This makes the code cleaner, reduces the allocations and remains backwards compatible. |
Will close this PR. The required refactoring won't have a lot to do with this. |
Motivation / Problem
I often use OpenTTD to get accustomed with new tools. This time I used heaptrack to analyse and locate allocations in the code.
There are some places in string handling that reallocate temporary char buffers per call. As OpenTTD has to format quite a few strings per frame, this accumulates quickly.
Description
edit: skip this part
Next is the
FormatString
function. Currently, this function uses a stack to keep track of the remaining string.This is not necessary as this stack can never reach 2 values. (Correct me if I am wrong.) Hence using a pointer suffices and prevents allocations altogether.
I used the first 30s of this savegame to analyse the differences. I am using the sdl(_opengl) drivers.
The overall temporary allocations are down by ~90% and FormatString now doesn't allocate at all.
ViewportDrawStrings
+ Callers is now responsible for 26% of total temporary allocation (48k), which was 67% (413k).Temp allocation hotspots after this PR are:
Layouter::GetCachedParagraphLayout
~ 25%ViewportSortParentSpritesSSE41
~ 34%Main allocation hotspot is
ViewportSortParentSpritesSSE41
with ~85% of all allocations.I am not sure how much this translates to FPS improvement. The 512 samples are not capable of smoothening out the frame time jitter on my dev system. Maybe a reviewer can give this a try.
Limitations
none
Checklist for review
Some things are not automated, and forgotten often. This list is a reminder for the reviewers.