Cache path matchers #3

donboie · 2018-02-26T19:44:50Z

No description provided.

+ removed restriction on non local tag queries on Cache writing.

+ This allows us to terminate the recursion when querying set names or the set membership.

- move tag functionality to SceneInterface and express in terms of sets. - added getRoot to scene interface to query the top level SceneInterface. - removed testWritingOnFlushedFiles as child SceneInterfaces always hold references to the root

johnhaddon

Hey Don,

Gonna just put all my thoughts here rather than mix GitHub with email. I haven't looked at everything, in fact I've only looked at the SceneInterface and SceneCache changes and then stopped, because there seems to be more than enough to talk about with just those. I'm a bit under the weather, so I might be misreading things, but if I'm not, I'm already concerned about both the performance and the clarity of the code. If my cold-addled brain is worrying about nothing, please forgive me...

My approach has been to store tags as sets on the root scene interface and make the tag API concrete on the SceneInterface and implement it as querying sets on the root. This requires a pointer to the root SceneInterface from any SceneInterface. I liked the idea of new SceneInterfaces just supporting the set API and the tags would just work.

I found that idea appealing too at first, but I want to at least play devil's advocate. My concerns are :

It doesn't really buy us anything. As we move to new formats, we'll be moving to sets anyway.
It totally reimplements a feature that is still really important at Image Engine, and is important for performance. Maybe you have benchmarks to prove otherwise, but I find it hard to believe that simple queries like hasTag() and readTag() have the same performance as before. My understanding is that they're now forced to construct sets (maybe even entire sets?) whereas before they were definitely cheap.

My main problem is with the SceneInterface pointer to the root SceneInterface and ensuring this is const correct.

I'm hoping that's achievable by dropping m_root and just using SceneInterface::scene(), but presumably it's not? I do think we have bigger problems to address first though. I think it'd be beneficial for us to step back a bit to correct any misconceptions I might have and/or address my concerns about the SceneCache changes. I'm not sure I really understand how the tag->set conversion is happening, or the performance implications...

Cheers...
John

johnhaddon · 2018-02-27T15:39:48Z

include/IECoreScene/SceneInterface.h

 		static void registerCreator( const std::string &extension, IECore::IndexedIO::OpenMode modes, CreatorFn f );

+	private:
+		SceneInterfacePtr m_root;


SceneInterface is meant to be an abstract class, so this feels wrong.

SceneInterface is meant to be an abstract class, so this feels wrong.

Do you mean it's supposed to be an interface class with no storage and only pure virtual function signatures? If it did have this property what are the benefits?

Do you mean it's supposed to be an interface class with no storage and only pure virtual function signatures?

Yes.

If it did have this property what are the benefits?

For me, trying to understand the code, one huge benefit would be clarity. I find the various SceneInterfaces are already a bit mind boggling, since they often have multiple internal implementation classes each with their own relationships. Pushing those relationships out into the base class really doesn't help.

Might this "all children have ownership of the root" setup also preclude certain implementations from storing their children for fast child() queries? Because it would force a circular reference? I have a nasty feeling I've had to avoid this pattern in the past for those reasons. Baking in assumptions about how these subclasses must be implemented seems wrong, especially because it's only to provide short term compatibility for tags. It's basically turning a hack into a structural feature.

Finally I guess it's the fact that we've been heading in a nice direction where we don't even expose the headers for our derived classes, so the implementation is all nicely hidden and changeable without breaking major version. This kindof breaks that.

johnhaddon · 2018-02-27T15:40:02Z

include/IECoreScene/SceneInterface.h

 		/// Returns a const interface for querying the scene at the given path (full path).
 		virtual ConstSceneInterfacePtr scene( const Path &path, MissingBehaviour missingBehaviour = ThrowIfMissing ) const = 0;

+		/// Returns the root SceneInteface for this heirahcy.


heirahcy -> hierarchy.

johnhaddon · 2018-02-27T15:42:36Z

include/IECoreScene/SceneInterface.h

 		virtual ConstSceneInterfacePtr scene( const Path &path, MissingBehaviour missingBehaviour = ThrowIfMissing ) const = 0;

+		/// Returns the root SceneInteface for this heirahcy.
+		SceneInterfacePtr getRoot() const { return m_root; }


As I've explained before, in Cortex and Gaffer, set/get are always paired and imply mutability of a member, so this should just be root(), since its immutable. That said, I don't get the need for this method - how is it different to calling this->scene( Path() )?

As I've explained before, in Cortex and Gaffer, set/get are always paired and imply mutability of a member.

Let's hope you never have to mention this again m _ _ m.

That said, I don't get the need for this method - how is it different to calling this->scene( Path() )?

I added this for the reading tags in the LinkedScene case. What happens when I want to read a tag on a linked scene which was saved with v10?

I'm afraid I don't understand the details here. But lower down you say "Looks like I can do this - thanks for pointing it out.", in reference to the same question, so I'm a bit confused.

I'm afraid I don't understand the details here. But lower down you say "Looks like I can do this - thanks for pointing it out.", in reference to the same question, so I'm a bit confused.

I took another look at this->scene( Path() ) and I don't need to keep a root SceneInterface. We do require every class derived from SceneInterface to do something like keeping reference to the root or something similar.

I took another look at this->scene( Path() ) and I don't need to keep a root SceneInterface.

Ah, good! Does that also take care of your const-correctness issues, since there's a const and a non-const scene(), or is that separate?

We do require every class derived from SceneInterface to do something like keeping reference to the root or something similar.

Yep, they need something similar in order to implement scene(). But they've all avoided using a SceneInterfacePtr to the root, even though that might have been the obvious choice, so I do wonder if that was for good reason.

Ah, good! Does that also take care of your const-correctness issues, since there's a const and a non-const scene(), or is that separate?

I think this will be OK, but as you've pointed out there are more important issues I should address first.

Yep, they need something similar in order to implement scene(). But they've all avoided using a SceneInterfacePtr to the root, even though that might have been the obvious choice, so I do wonder if that was for good reason.

Each concrete implementation of SceneInterface needs to either store some shared data or pointer to parent. For ::scene to be implemented we in SceneCache we need to walk parent pointers back to the root.

cortex/src/IECoreScene/SceneCache.cpp

Line 585 in e160d55

ReaderImplementation *root = this;

Yep, but for instance, Alembic Scene stores things the other way, and each location owns a map of its children :

https://github.com/ImageEngine/cortex/blob/master/contrib/IECoreAlembic/src/IECoreAlembic/AlembicScene.cpp#L555

Now it's not storing a map of SceneInterfacePtr so I think it would be OK, but if it did want to (and that doesn't seem unreasonable), then the root pointer would preclude that because it would cause a circular reference...

johnhaddon · 2018-02-27T16:03:36Z