Stefan Glienke

Members

View Profile See their activity

Content Count
1498
Joined
October 18, 2018
Last visited
4 hours ago
Days Won
152

Content Type

All Activity

Profiles

Forums

Topics
Posts

Calendar

Events

Everything posted by Stefan Glienke

Prev
1
2
3
4
5
6
7
8
9
10
Next
Page 5 of 57

Devin AI - Is it already happening?

Stefan Glienke replied to FreeDelphiPascal's topic in General Help
- July 29, 2024
- 30 replies
Remove uses with interface

Stefan Glienke replied to Berocoder's topic in I made this

This and also a kind of limited approach given that the lifetime of all those UI elements is bound to the application lifetime and they are all singletons (meaning that you cannot have more than one instance of say ITestDlg)
- July 29, 2024
- 3 replies
Devin AI - Is it already happening?

Stefan Glienke replied to FreeDelphiPascal's topic in General Help

The last thing I know of is that Devin AI was fake (that info is from April 2024 when the entire developer community was all over the demo they showcased). Do you know more?
- July 24, 2024
- 30 replies
how to filter on source files using VTune?

Stefan Glienke replied to merijnb's topic in Delphi Third-Party

Changing anything with the map file will only remove the names, the functions being called will still be in the sampling result because VTune is a sampling profiler. Given that obviously, your performance problem was not with these functions they should not be a significant part of the result percentage-wise - if they are, they are part of the issue.
- July 8, 2024
- 4 replies
Unicode NBSP(u00A0) No-Break Space character for 64-bit

Stefan Glienke replied to sp0987's topic in RTL and Delphi Object Pascal

Can't be asked - I am using the sfw (safe for work) version 😇
- June 26, 2024
- 14 replies
Unicode NBSP(u00A0) No-Break Space character for 64-bit

Stefan Glienke replied to sp0987's topic in RTL and Delphi Object Pascal

FWIW the assembly implementation of that function is pointless given you can do it in a way more readable way - also performance cannot be a reason given the following call to mem_write causes a temporary heap allocation to convert your hex_code variable into an AnsiString. Quickly slapped together (anyone who wants to further optimize this - be my guest, i cba right now): type hex_code = Array [1 .. 4] of AnsiChar; function Int2Hex(c: Word): hex_code; const HexChars: array[0..15] of AnsiChar = '0123456789ABCDEF'; var i: NativeUInt; begin i := c; Result[4] := HexChars[i and $0F]; i := i shr 4; Result[3] := HexChars[i and $0F]; i := i shr 4; Result[2] := HexChars[i and $0F]; i := i shr 4; Result[1] := HexChars[i and $0F]; end; and then instead of mem_write with AnsiString as the first argument use one that writes 4 bytes at once.
- June 26, 2024
- 14 replies
Unicode NBSP(u00A0) No-Break Space character for 64-bit

Stefan Glienke replied to sp0987's topic in RTL and Delphi Object Pascal

Your assembly code of Int2Hex for 64bit is wrong - c is passed in RCX.
- June 25, 2024
- 14 replies
Don't freak out! It's just a bug: Debugging with Delphi

Stefan Glienke replied to silvercoder79's topic in Tips / Blogs / Tutorials / Videos

Nitpick at 10:30 - strings are always empty when not explicitly initialized like all managed types.
- June 24, 2024
- 2 replies
Addictive Software gone ?

Stefan Glienke replied to Baron Software's topic in VCL

Please see the first entry that Google provides when searching for "Addictive Software" When looking for spell-checking alternatives - there is also this thread:
- June 19, 2024
- 7 replies
Parallel.ForEach is really slow

Stefan Glienke replied to stephane's topic in OmniThreadLibrary

I can absolutely repro - all my 20 logical cores (i5-13600k) go to 100% for 10 seconds. Running it through SamplingProfiler right now to check. Edit: Okay, either this has regressed at some point after the demo was originally built or it was overlooked that there is actually no real workload inside of the delegate and thus it just measures the huge overhead from RTL and interlocked operations. It's just spending a huge amount of time wrapping and unwrapping the integer from TOmniValue and sharing the counter across all threads causing a complete bus lock every time due to the usage of DSiInterlockedExchangeAdd64 (*). (*) I wrote bus lock and this is not entirely true, so before anyone chimes in quoting the Intel manual about the fact that the lock prefix does not automatically cause a bus lock - you are correct. Here we have the case that we are sharing the one variable across all the cores we have so it has to travel back and forth the CPU caches to and from RAM. This code as is would be a nice worst-case example for Primoz' book about what can potentially go wrong when doing parallel programming. However: keep in mind that we don't have any real workload which would most likely change the picture as the workload usually takes the majority of processing time and not the parallel foreach code. P.S. Among the aforementioned things it looks like the OTL code (particularly TOmniValue) is suffering from some overhead caused by inlining. For example: because TOmniValue.AsInt64 as well as TOmniValue.TryCastToInt64 is inlined it causes the temporary Variant variable it needs for the otvVariant case to be spilled into the location where the inlining happens. But in our case we never have to even deal with a Variant hence all the extra code the compiler produces is just overhead. And because the getter for AsInt64 is used twice, the compiler repeats the inlined code twice and produces two Variant variables which it finalizes using System.FinalizeArray. Also a lot of record initialization and finalization is happening which I assume (did not look closer) is being caused by TOmniValue - potentially also because of some temporary compiler generated variables. Here is the drilldown of the interesting pieces when running in SamplingProfiler:
- June 7, 2024
- 5 replies
rease ... at ReturnAddress

Stefan Glienke replied to stijnsanders's topic in General Help

No, you can even see the stackframe settings in System.pas because it explicitly specifies them at the very beginning. Furthermore, even without explicitly enabling them, the RTL is being compiled with $W- It's a compiler implementation that it enables stackframe for any function that uses ReturnAddress - you can double-check that for yourself by compiling the following code and looking at the disassembly for Foo: {$STACKFRAMES OFF} procedure Bar(p: Pointer); begin end; procedure Foo; begin Bar(returnAddress); end; begin Foo; end. I know this because I was using ReturnAddress in my code and it behaved wrong in XE where it was implemented explicitly as I linked in my post above but without stackframe, this returned a wrong address. This is why I explicitly enable stackframes for the code that uses this function in XE - see https://bitbucket.org/sglienke/spring4d/src/2dbce92195d699d51fc99dd226c4698748ec8ef9/Source/Base/Spring.pas#lines-3474
- June 5, 2024
- 9 replies
rease ... at ReturnAddress

Stefan Glienke replied to stijnsanders's topic in General Help

XE2 - see https://bitbucket.org/sglienke/spring4d/src/2dbce92195d699d51fc99dd226c4698748ec8ef9/Source/Base/Spring.pas#lines-3140 Anyhow the ReturnAddress function would not help here because it would point at the location in System._AbstractError which is the function that calls AbstractErrorProc. This improved handler is a highly brittle hack that might work depending on your compiler settings. It does not force specific compiler settings for the u_dzAbstractHandler.pas unit which also changes where the return address 2 calls up is to be found and it also does not even work for x64. I also don't share Thomas' assessment that this change came in Tokyo but I rather suspect that his stackframe settings were different between his tries on different Delphi versions. The code that is responsible for the abstract error did not change between Delphi XE (the oldest version I can check right now) and 12. The best way would probably be to use Caller(2) from JclDebug which does a proper stack walking to determine the return address.
- June 5, 2024
- 9 replies
Refactoring in Delphi

Stefan Glienke replied to Berocoder's topic in MMX Code Explorer

At least not if they promote that developer to PM
- June 3, 2024
- 6 replies
Delphi on Surface Pro with Qualcomm CPU?

Stefan Glienke replied to PeterPanettone's topic in Windows API

You might have misunderstood my sentence because obviously I was referring to any Delphi compiler that might target ARM - have you looked into the JIRA reports I linked?
- May 26, 2024
- 20 replies
What are the performance profilers for Delphi 12?

Stefan Glienke replied to dmitrybv's topic in General Help

I'll take that opportunity and talk about it for an entire session at the Delphi Day and the Delphi Summit (the schedule still lists me talking about spring4d but for once I pass on that topic) 😉
- May 24, 2024
- 17 replies
Delphi on Surface Pro with Qualcomm CPU?

Stefan Glienke replied to PeterPanettone's topic in Windows API

Given the current issues regarding optimization that all LLVM-based Delphi compilers (that is all but the two Windows ones) have I am tempted to say that an x86 or x64 binary using the emulation layer might be faster than what a compiler that directly targets ARM would produce today. There are multiple reports about this and it boils down to "need to migrate to a newer LLVM version" which we have been told for years now - since recently the C++ Builder side was migrated to a recent LLVM version I hope that now the Delphi side gets addressed. https://quality.embarcadero.com/browse/RSP-9922 https://quality.embarcadero.com/browse/RSP-17724 https://quality.embarcadero.com/browse/RSP-25754 https://quality.embarcadero.com/browse/RSP-28006
- May 24, 2024
- 20 replies
FYI: Stumbled upon interesting ASM optimization trick LLVm can do (most likely others also)

Stefan Glienke replied to Tommi Prami's topic in Algorithms, Data Structures and Class Design

There are multiple considerations - I don't know what compiler version, target, and option he was using to conclude that it will use a lea rather than shift - at least with -O3 it will use shift for multiplications by 4 and 8 although lea would also be applicable - for mul by 2 most likely add is being emitted because that is just the smaller instruction. Another consideration is if the value is needed further - for example, x * 7 is implemented as x * 8 - x - and here it cannot use shift for the * 8 because it needs the original value of x to subtract, therefor it uses lea to store the result in another register and subtract the original register from it. Regarding the LEA instruction - I just remembered that I also reported that it should utilize this instruction when doing pointer math - see https://quality.embarcadero.com/browse/RSP-34820
- May 24, 2024
- 11 replies
FYI: Stumbled upon interesting ASM optimization trick LLVm can do (most likely others also)

Stefan Glienke replied to Tommi Prami's topic in Algorithms, Data Structures and Class Design

Interestingly the Delphi compiler did some optimization for multiplication by const for quite a while but it was weird and not very well - see https://quality.embarcadero.com/browse/RSP-38636. Unfortunately, when implementing this someone missed looking into the instruction timings of imul vs the replacements and so we got a degradation in some cases - see https://embt.atlassian.net/servicedesk/customer/portal/1/RSS-1011.
- May 24, 2024
- 11 replies
What do you think of "Local Global variables"

Stefan Glienke replied to Tommi Prami's topic in RTL and Delphi Object Pascal

I've seen this mentioned before yet I never found this to be true - at least not in recent (past decade) Delphi versions.
- May 21, 2024
- 35 replies
I'm on the Dark Side... no, really!

Stefan Glienke replied to eivindbakkestuen's topic in GExperts

It looks to be fixed, but the Grep Search window is too small - HighDPI with 175%:
- May 21, 2024
- 11 replies
What do you think of "Local Global variables"

Stefan Glienke replied to Tommi Prami's topic in RTL and Delphi Object Pascal

While I at times use local routines I avoid accessing outer scope local variables like the plaque because it usually generates quite a huge and often unnecessary stack frame.
- May 16, 2024
- 35 replies
XX3_64Bits -hash really flies

Stefan Glienke replied to Tommi Prami's topic in Cross-platform

Spring4d has an optimized version for XXH32 that is being used for the hashtables (dictionary and co) - see https://bitbucket.org/sglienke/spring4d/src/2dbce92195d699d51fc99dd226c4698748ec8ef9/Source/Base/Spring.Hash.pas#lines-46 Since all other versions in that family have a larger width and the hashcode in hashtables is typically 32bit (larger would only be worth once you have more than 2^30 items in it) I did not bother to implement the others as well (also XXH3 is a bit more complicated, XXH32 is quite simple actually) Also since the architecture of Spring4d 2.0 is pluggable you can replace it with your own hash function if anyone wants to come up with a faster one (I tried for example one from mormot2) - see https://bitbucket.org/sglienke/spring4d/src/2dbce92195d699d51fc99dd226c4698748ec8ef9/Source/Base/Spring.Comparers.pas#lines-87 Keep in mind though that the usecase in Spring4d is for hashtables - which means these hash functions typically don't hash gigabytes of data as in other use cases. Nevertheless, the faster the better. As for a Delphi wrapper for the original C++ implementation -see https://github.com/YWtheGod/XXHASH4Delphi
- May 16, 2024
- 2 replies
I'm on the Dark Side... no, really!

Stefan Glienke replied to eivindbakkestuen's topic in GExperts

In rev 4252 there is a bug with the procedure list window on high dpi (possibly others? I did not test), it grows every time it's being opened. Edit: Sorry, should have gone into this thread:
- May 15, 2024
- 11 replies
How to patch a constructor?

Stefan Glienke replied to pyscripter's topic in RTL and Delphi Object Pascal

You don't want to do that - try solving your issue in AfterConstruction or otherwise. Code generated for constructors is more than meets the eye - replicating that without mistakes can be quite the endeavor.
Delphi and "Use only memory safe languages"

Stefan Glienke replied to Die Holländer's topic in General Help

And unless they are implemented as thick 2-tier clients directly accessing some shared database they probably talk to some backend code where the interop with other systems is implemented. But let me make a slight correction about the particular Delphi situation: I know some people successfully build mobile apps but the majority are Windows (and some Mac OS) desktop applications I am also leaving out all the web stuff because Delphi does not play a significant role in that area (yes, I know about the various frameworks for doing web stuff with Delphi), and depending on what technology is used most of it is backend code. There has to be some reason why some recently very famous programming languages don't have some easy-to-use UI frameworks - either because they run almost everywhere including your toaster which makes it hard to provide some all-in-one solution (how often have UI frameworks tried that already?) or because it's code that does not require some rich UI.
- April 22, 2024
- 167 replies

Prev
1
2
3
4
5
6
7
8
9
10
Next
Page 5 of 57

Sign In

Stefan Glienke

Content Count

Joined

Last visited

Days Won

Content Type

Profiles

Forums

Calendar

Everything posted by Stefan Glienke

Devin AI - Is it already happening?

Remove uses with interface

Devin AI - Is it already happening?

how to filter on source files using VTune?

Unicode NBSP(u00A0) No-Break Space character for 64-bit

Unicode NBSP(u00A0) No-Break Space character for 64-bit

Unicode NBSP(u00A0) No-Break Space character for 64-bit

Don't freak out! It's just a bug: Debugging with Delphi

Addictive Software gone ?

Parallel.ForEach is really slow

rease ... at ReturnAddress

rease ... at ReturnAddress

Refactoring in Delphi

Delphi on Surface Pro with Qualcomm CPU?

What are the performance profilers for Delphi 12?

Delphi on Surface Pro with Qualcomm CPU?

FYI: Stumbled upon interesting ASM optimization trick LLVm can do (most likely others also)

FYI: Stumbled upon interesting ASM optimization trick LLVm can do (most likely others also)

What do you think of "Local Global variables"

I'm on the Dark Side... no, really!

What do you think of "Local Global variables"

XX3_64Bits -hash really flies

I'm on the Dark Side... no, really!

How to patch a constructor?

Delphi and "Use only memory safe languages"

Browse

Activity