[refactor] Move Type::ctype into an external AA by yebblies · Pull Request #4569 · dlang/dmd

yebblies · 2015-04-07T12:03:12Z

The idea is that backend-specific passes shouldn't need to add methods to the frontend ast classes. Using a hash scales much better, if the performance is acceptable.

yebblies · 2015-04-07T13:19:51Z

I can't see any performance difference on the autotester.

MartinNowak · 2015-04-10T20:06:34Z

It's a logical consequence to do this, though I wonder if other alternatives have been evaluated.
In the long-run we'd need a better AA implementation for this.

One alternative idea would be to have a backend header, declaring a backend specific payload, that gets embedded in the frontend types. This could remain an opaque type for the frontend.

yebblies · 2015-04-11T05:15:12Z

It's a logical consequence to do this, though I wonder if other alternatives have been evaluated.

Other alternatives?

In the long-run we'd need a better AA implementation for this.

I thought the compiler's AA was pretty fast.

One alternative idea would be to have a backend header, declaring a backend specific payload, that gets embedded in the frontend types. This could remain an opaque type for the frontend.

The ultimate goal is compiler-as-a-library, which cannot (reasonably) work like that. We need a way to add passes without any modification of the ast types.

MartinNowak · 2015-04-11T08:37:27Z

I thought the compiler's AA was pretty fast.

The string table is fast, the aav is better than a few years ago, but not great.

The ultimate goal is compiler-as-a-library, which cannot (reasonably) work like that.

Injecting them via template argument?
I'm just worried about the overall performance. Double dispatch already comes at quite some price, if we now start to turn field accesses into hash lookups, we'll add another source of uniform slowdown.

How often is the ctype looked up, e.g. when compiling Phobos.

yebblies · 2015-04-11T10:11:51Z

Injecting them via template argument?

I'd consider that unreasonable.

I'm just worried about the overall performance. Double dispatch already comes at quite some price, if we now start to turn field accesses into hash lookups, we'll add another source of uniform slowdown.

Yeah, that's true. Although it doesn't looks like it's big enough to worry about.

How often is the ctype looked up, e.g. when compiling Phobos.

Let's find out.

MartinNowak · 2015-04-14T09:38:51Z

It would help, if you can provide those numbers and also explain what other fields you want to move out of the AST.

ibuclaw · 2015-04-14T10:12:43Z

also explain what other fields you want to move out of the AST.

aggregate.h:

AggregateDeclaration
- Symbol *stag
- Symbol *sinit
ClassDeclaration
- Symbol *vtblsym

declaration.h:

FuncDeclaration
- Symbol *shidden

dsymbol.h:

Dsymbol
- Symbol *csym
- Symbol *isym

enum.h:

EnumDeclaration
- Symbol *sinit

expression.h:

StructLiteralExp
- Symbol *sinit
- Symbol *sym

module.h:

Module
- int doppelganger
- Symbol *cov
- unsigned *covb
- Symbol *sictor
- Symbol *sdtor
- Symbol *ssharedctor
- Symbol *sshareddtor
- Symbol *stest
- Symbol *sfilename
- Symbol *massert
- Symbol *munittest
- Symbol *marray

mtype.h:

Type
- type *ctype

statement.h:

LabelStatement
- block *lblock
- Blocks fwdrefs
DefaultStatement
- block *cblock (gdc only)
CaseStatement
- block *cblock

@yebblies - Did I miss any other transparent BE types?

yebblies · 2015-04-14T11:21:37Z

@ibuclaw The list from backend.d is

struct Symbol;
struct TYPE;
alias type = TYPE;
struct code;
struct block;

Would be nice for that list to be empty.

It would help, if you can provide those numbers and also explain what other fields you want to move out of the AST.

@MartinNowak Building phobos on win32 results in ~10^6 ctype lookups. I want to move all backend-specific types out of the ast. I might look at doing the same with other passes (eg ctfe) sometime in the future.

Safety0ff · 2015-04-15T18:17:59Z

Keeping backend fields in the AST also eagerly allocates memory for fields that are never used.
E.g. I moved csym and isym out of Dsymbol and it resulted in a ~1.8% overall memory reduction for building phobos with no performance impact.

MartinNowak · 2015-04-15T20:59:37Z

src/toctype.c

What happened here? Seems unrelated.

Just a minor refactoring.
In the old version: If the unqualified version of this type already has a ctype, we copy it and add the qualifiers. Otherwise we create a new Classsym/ctype.

In the new version, we generate a new ctype if this is the unqualified version, otherwise recurse to get the unqualified ctype then copy it.

As far as I can tell, the old code would generate a symbol with the wrong qualifiers when called with a qualified before being called on the unqualified symbol, because both would be generated and set to the same value but the 'add modifiers' code would never be reached.

WalterBright · 2015-04-18T01:48:27Z

The autotester isn't a great guide for performance testing, as it only deals with small programs.

A compiler slows down bit by bit, from barnacles accumulating on the hull one by one. Each one doesn't do much, but the accumulation does.

Perhaps use an opaque pointer instead?

yebblies · 2015-04-18T05:48:23Z

The autotester isn't a great guide for performance testing, as it only deals with small programs.

The win32 phobos build is fairly big.

A compiler slows down bit by bit, from barnacles accumulating on the hull one by one. Each one doesn't do much, but the accumulation does.

I know, but sometimes it's worth it.

Perhaps use an opaque pointer instead?

A pointer to what, though? Different passes need to store different data, and having a pointer for each is exactly the problem I'm hoping to avoid.

There were similar concerns about the minor but systemic slowdowns from switching to the double-dispatch visitor implementation, but I think that has payed off in making the code more maintainable. My goal it to make it as easy as possible to maintain glue layers/backends, and that means removing all backend-specific code and data from the frontend ast.

MartinNowak · 2015-04-19T19:02:33Z

There were similar concerns about the minor but systemic slowdowns from switching to the double-dispatch visitor implementation, but I think that has payed off in making the code more maintainable.

It was a requirement for ddmd, because D doesn't support implementing classes in multiple files.
The visitor might also have helped gdc/ldc, and we separated a few things in the compiler
better. That said, it still slows down the compiler and is slightly more difficult to maintain (try
to jump to CondExp::toIR or debug something).
Moving data to hashtables will also make things slightly slower and slightly harder to maintain.

A pointer to what, though? Different passes need to store different data, and having a pointer for each is exactly the problem I'm hoping to avoid.

You can statically declare "opaque" (as in ptr) backend types (in the glue layer) and embed them in the AST.
It's still possible to track additional data for optional passes in a hash table.
Using a hashtable by default is a pessimization for non-sparse data.

Can you please try to find out what LLVM et.al. do?

ibuclaw · 2015-04-20T09:01:16Z

A pointer to what, though? Different passes need to store different data, and having a pointer for each is exactly the problem I'm hoping to avoid.

I think Walter meant void*? If so, still not pleasant.

yebblies · 2015-04-20T10:49:55Z

It was a requirement for ddmd, because D doesn't support implementing classes in multiple files.

Yeah, sort of. There were other options but that was by far the best.

The visitor might also have helped gdc/ldc, and we separated a few things in the compiler
better. That said, it still slows down the compiler and is slightly more difficult to maintain (try
to jump to CondExp::toIR or debug something).

The fact that you no longer need to update headers when adding a pass more than makes up for any maintenance burden IMO.

Moving data to hashtables will also make things slightly slower and slightly harder to maintain.

There's no guarantee it makes it slower. When running e2ir, the ctype hash table is much more likely to be in cache than the Type classes themselves. (Not so useful in the current AA but I have a version with one less indirection that is much more cache-friendly.)
The pointer to the Type never needs to be dereferenced for the hash table access.

You can statically declare "opaque" (as in ptr) backend types (in the glue layer) and embed them in the AST.

ie the current system. Requires knowing which passes need cached data at compile time, potential wasted memory if they're not needed. Some backends require multiple values per class, and this either means lots of ugly or an extra indirection.

It's still possible to track additional data for optional passes in a hash table.

It's not really about optional passes, it's about not polluting the frontend code with backend-specific data members.

Using a hashtable by default is a pessimization for non-sparse data.

Not always. More cache friendly, less space allocated when not all classes are codegen'd, etc

Can you please try to find out what LLVM et.al. do?

I tried, didn't get very far.

Is this just the usual knee-jerk reaction to anything that might hurt performance? Are there any concrete goals that would make this acceptable? I honestly would accept a fairly big performance hit if it got us closer to having a 100% unified frontend.

yebblies · 2015-04-20T10:53:42Z

@ibuclaw Would you accept using this kind of approach in GDC? @klickverbot @redstar Would you use this in LDC? I assume you guys have access to fast hash tables from your backends' support libraries.

ibuclaw · 2015-04-20T11:05:10Z

@yebblies - I'm considering a future of GDC without any IRState, Symbol, or dt_t baggage from DMD. So far the design is looking much cleaner, but then again it is not complete either. ;-)

ibuclaw · 2015-04-20T11:06:00Z

I assume you guys have access to fast hash tables from your backends' support libraries.

Yes.

MartinNowak · 2015-04-20T15:32:19Z

less space allocated when not all classes are codegen'd

That's why I said non-sparse data ;), might indeed make sense for rare backend data.

Is this just the usual knee-jerk reaction to anything that might hurt performance? Are there any concrete goals that would make this acceptable? I honestly would accept a fairly big performance hit if it got us closer to having a 100% unified frontend.

The last release came with a 10-25% slowdown (Issue 14431).

When running e2ir, the ctype hash table is much more likely to be in cache than the Type classes themselves.

That's an interesting argument, but it doesn't quite work out for ctype. The slowdown is neglectable but measureable.
We cannot decide this on the basis of such a tiny change. How about we convert most if not all backend types and maybe tweak the AA first to have a meaningful comparison.
I'm generally in favor of this change.

ibuclaw · 2015-04-20T15:46:13Z

How about we convert most if not all backend types and maybe tweak the AA first to have a meaningful comparison.

+1 - I agree, let's convert all and benchmark.

yebblies · 2015-04-20T16:51:48Z

We cannot decide this on the basis of such a tiny change. How about we convert most if not all backend types and maybe tweak the AA first to have a meaningful comparison.
I'm generally in favor of this change.

Thanks, I can work with that.

redstar · 2015-04-20T20:04:38Z

Yes, let's try it.

redstar · 2015-04-21T05:01:09Z

Considering the performance impact: Why not using a factory class for AST nodes? In this case each backend could provide decorated AST nodes. The factory itself could be made configurable to support a library solution. The trade-off would be a cast to the new AST types if access to new members is required.

yebblies · 2015-04-21T08:11:05Z

Considering the performance impact: Why not using a factory class for AST nodes? In this case each backend could provide decorated AST nodes. The factory itself could be made configurable to support a library solution. The trade-off would be a cast to the new AST types if access to new members is required.

The big downside of that is that it is a very invasive change. It also only supports a single backend at a time.

ibuclaw · 2015-04-21T09:11:10Z

It also only supports a single backend at a time.

Why would we want to be interchanging backends during the same compilation process? Correct me if I misread this. :-)

yebblies · 2015-04-21T09:24:18Z

Why would we want to be interchanging backends during the same compilation process? Correct me if I misread this. :-)

I mean that CTFE is sort of another backend, etc.

ibuclaw · 2015-07-25T15:52:13Z

Any update on this? I'd like to push for removing struct block from arraytypes.h

yebblies · 2015-07-26T00:39:20Z

I haven't touched it since dconf. I had a big argument with Walter about this, and he basically said that the only way he would accept this is if I can show that with all of the similar changes implemented dmd's performance improves in some measurable way. i.e. showing that the performance difference is negligible is not enough. Also that we should beware of crustaceans and their ill effects on the movement speed of ships.

I did some work on toSymbol but didn't get it to stop segfaulting, and haven't gone near it since dconf.

ibuclaw · 2015-07-26T04:11:41Z

If speed is a problem, maybe convert the AA hash implementation into a template? You could also allow overriding how keys are hashed to be naive for speed (eg: integer and pointer keys don't need hashing).

ibuclaw · 2016-05-04T15:15:41Z

@MartinNowak @yebblies ping.

So, lets get benchmarks on this, and decide if it's the best approach? The only other approach I can think of is to have a synthetic struct pointer, which each visitor in the glue defines locally.

E.g:
toctype.c

struct X {
  type *ctype;
}

tocsym.c

struct X {
  Symbol* stag;
  Symbol* sinit;
}

And so on.

However I think this is the most agreeable suggestion so far.

andralex · 2017-11-20T18:50:44Z

There's been controversy on this, and @WalterBright all but vetoed this approach. However the context has changed since, what with the more widespread use of visitors by @RazvanN7 and others. Thoughts on reviving or reframing this? I'm thinking an opaque pointer is a simple technique that other compilers use as well.

ibuclaw · 2017-11-20T20:30:10Z

It's still useful on my side to have all dmd-specific fields removed.

ibuclaw · 2022-05-01T13:03:51Z

It also saves memory on frontend AST nodes (#13808).

yebblies mentioned this pull request Apr 12, 2015

Unresolved differences between gdc/dmd front ends #2194

Closed

MartinNowak reviewed Apr 15, 2015
View reviewed changes

yebblies force-pushed the ctypeaa branch from 7abc0f2 to c08dd60 Compare March 30, 2016 10:15

yebblies added 17 commits May 7, 2016 21:24

Add back aav.h

d42cc8a

Move Type::ctype into an external AA

e6c8389

Remove most direct access to Dsymbol.csym

3a75376

Replace Dsymbol.csym with an AA

2ae0988

Replace Dsymbol.isym with an AA

6049b70

Replace stag and sinit with AA lookup

d933a39

Replace vtblsym with AA lookup

bfea796

Replace cpp_type_info_ptr_sym with AA lookup

7ffd958

Replace shidden with an AA

02473f4

Move ClassReferenceExp's symbol to an AA

e8a1707

Move some Symbols out of Module

7891350

Move sfilename out of Module

46412e3

Move massert/munittest/marray out of Module

5d6f73c

Remove coverage symbols from Module

7e5094c

Remove Symbol from the frontend

de2bf4a

Move backend members out of AsmStatement

d22ca83

Delete the rest of the backend types

53abff4

yebblies force-pushed the ctypeaa branch from c08dd60 to 53abff4 Compare May 12, 2016 16:11

dlang-bot added Review:Needs Rebase Review:Needs Work Review:stalled labels Jan 1, 2018

ibuclaw self-assigned this Jan 30, 2018

ibuclaw mentioned this pull request Aug 20, 2019

statement_toIR(): simplify conversion of case statements to goto #10322

Merged

dlang-bot added Merge:needs rebase stalled labels Dec 3, 2024

Uh oh!

Comments

Conversation

yebblies commented Apr 7, 2015

Uh oh!

yebblies commented Apr 7, 2015

Uh oh!

MartinNowak commented Apr 10, 2015

Uh oh!

yebblies commented Apr 11, 2015

Uh oh!

MartinNowak commented Apr 11, 2015

Uh oh!

yebblies commented Apr 11, 2015

Uh oh!

MartinNowak commented Apr 14, 2015

Uh oh!

ibuclaw commented Apr 14, 2015

Uh oh!

yebblies commented Apr 14, 2015

Uh oh!

Safety0ff commented Apr 15, 2015

Uh oh!

MartinNowak Apr 15, 2015

Choose a reason for hiding this comment

Uh oh!

yebblies Apr 16, 2015

Choose a reason for hiding this comment

Uh oh!

WalterBright commented Apr 18, 2015

Uh oh!

yebblies commented Apr 18, 2015

Uh oh!

MartinNowak commented Apr 19, 2015

Uh oh!

ibuclaw commented Apr 20, 2015

Uh oh!

yebblies commented Apr 20, 2015

Uh oh!

yebblies commented Apr 20, 2015

Uh oh!

ibuclaw commented Apr 20, 2015

Uh oh!

ibuclaw commented Apr 20, 2015

Uh oh!

MartinNowak commented Apr 20, 2015

Uh oh!

ibuclaw commented Apr 20, 2015

Uh oh!

yebblies commented Apr 20, 2015

Uh oh!

redstar commented Apr 20, 2015

Uh oh!

redstar commented Apr 21, 2015

Uh oh!

yebblies commented Apr 21, 2015

Uh oh!

ibuclaw commented Apr 21, 2015

Uh oh!

yebblies commented Apr 21, 2015

Uh oh!

ibuclaw commented Jul 25, 2015

Uh oh!

yebblies commented Jul 26, 2015

Uh oh!

ibuclaw commented Jul 26, 2015

Uh oh!

ibuclaw commented May 4, 2016

Uh oh!

andralex commented Nov 20, 2017

Uh oh!

ibuclaw commented Nov 20, 2017

Uh oh!

ibuclaw commented May 1, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone