Fix Thread leaking in ThreadLocalVariables #164

pitr-ch · 2014-10-05T14:01:07Z

introduce #bind method to avoid value leaks
warn on using just #value=

elspethsoup · 2014-10-08T16:01:11Z

The part I don't understand is what is the harm in using the mechanism provided by ruby as the underlying store outside of jruby like the previous implementation? Switching from using symbols to the object_id of the ThreadLocalVar instance as the key seems like it would do that trick nicely with Thread#[] and Thread#thread_variable_get.

chrisseaton · 2014-10-08T16:24:40Z

I've been looking for the original discussions, but can't find them at the moment. There was a lot of back-and-forth about this and it's changed several times, but I don't think we've arrived at the right solution yet. Perhaps we all need to have a think, arrive at a consensus carefully, and then code it up.

chrisseaton · 2014-10-08T16:28:03Z

This was part of the discussion #22.

I'm thinking maybe we should use Thread#[], use the ThreadLocalVar as the key, but via WeakRef, and occasionally prune Thread#[] for collected ThreadLocalVars.

pitr-ch · 2014-10-09T06:56:03Z

@soupmatt I wanted to do something like that too, but:

[1] pry(main)> Thread.current[123] = 'asd'
TypeError: 123 is not a symbol

It accepts only un-collectable Symbols as a key :/ (Strings are converted to symbols silently)

chrisseaton · 2014-10-09T08:25:05Z

Ah yes - that was the key problem. Maybe we should just give up and get the programmer to specify their own symbol in the constructor?

pitr-ch · 2014-10-09T08:41:12Z

I somewhat liked the current approach because then user is warned. (Hm, I should have probably put there a better warning message with better explanation, how and when the value leaks.) But the more I am thinking about this the more I believe I should find time to implement a WeakMap where only keys are weak, then a ThreadLocalVar would hold have this WeakMap with the values and Threads as keys. That should fix it.

elspethsoup · 2014-10-09T18:59:58Z

@pitr-ch, the ref gem has a WeakKeyMap that might inform a possible implementation using weak references. You could use the Threads themselves as the keys, and then the values will get released when the keys are finalized.

- introduce #bind method to avoid value leaks - warn on using just #value=

pitr-ch · 2014-10-10T19:34:27Z

@soupmatt thanks for the link! I've looked at the implementation and it already does what I was about to write. (It verifies that the object_id was not reallocated for different object as well which I saw being an issue in similar lib.)

We are very carefully about adding dependencies, so far we have none. So we need to decide what to do next?

include dependency on ref
reimplement same weak_key_map in concurrent-ruby
do no add dependency into spec, just failing when ThreadLocalVar is required warning that ref needs to be installed, implies that ThreadLocalVar will become optional part of concurrent-ruby

cc @jdantonio, @mighe, @chrisseaton, @lucasallan

jdantonio · 2014-10-13T13:26:10Z

@pitr-ch Thanks for looking into this and for the great suggestions. As you can probably guess I'm not a big fan of including other gems as dependencies. I'm not sure how long that position will remain pragmatic, but it is a slippery slope. Without that rule we would have already added dependencies on ruby-atomic and redcard at least. That being said, we want our code to be reliable. I'm comfortable with the third suggestion: optional requirement. But rather than failing if ref hasn't been required could we instead display a warning and use the current implementation? If I understand the situation (and I'll admit I'm not 100% sure I do) then it seems that the leak is very minor and may have no appreciable effect in most cases.

pitr-ch · 2014-10-20T08:02:46Z

I think the leak is not that minor, let me provide an example: User will have local thread variable (e.g. current_user) and his application will be created Threads per request assigning a current user within the thread, then all the values assigned in the threads will leak. I've intentionally chosen storage on variable so it leaks value per thread not value per variable which would be happening for storage on thread. This comes from assumption that creating of many threads is bad, threads should be reused.

Therefore I would rather fix this or disable ThreadLocalVariable when ref is not loaded. Even with warning it could lead to unpleasant surprises.

pitr-ch · 2014-11-08T16:23:00Z

Fixed to use ref gem to avoid any leaks. It is not required by default. It fails on MRI where there is no ref gem installed. I did not include a fallback version to avoid any nasty surprise all together. Please review.

ThreadLocalVar needs to be required manually.

coveralls · 2014-11-08T16:35:03Z

Coverage decreased (-0.05%) when pulling 096a10c on thread-local-var into 9513562 on master.

jdantonio · 2014-11-09T15:10:59Z

lib/concurrent/atomic/thread_local_var.rb

Instead of raising an error and refusing to work, could we instead output a warning and fall back to the original, degraded implementation? Leave it to the individual user to decide if they want to use the ref gem.

This is the approach the elasticsearch gem uses to enable persistent connections. It detects the presence of either typhoeus or patron and uses that gem when available. Without either of those optional dependencies the elasicesearch gem still works, it just doesn't perform as well.

To me a memory leak means broken not just degraded implementation I would rather not provide it at all. As an user when I miss the warning about memory leak ending up debugging it for hours I would be much more frustrated that adding one dependency at the beginning. From my experience I really like libraries which are failing early explaining the problem. If you really want to avoid the dependency I could reimplement the weak_key_map for concurrent-ruby but than we'll have to maintain it.

If this is a bug and not just degraded functionality then we need to fix it. I have mixed feelings about the Ref gem. For pragmatism and expedience I'll probably suggest that for now we use Ref, but let me explain in more detail my concerns about adding dependencies to gems such as our. Adding dependencies to utility gems can lead to four problems, all of which I have experienced in production systems:

Downstream incompatibilities: Once gem depends on other gems which depend on other gems and so on. Adding one gem dependency results in a tree of several gem dependencies being added. Incompatibilities then occur downstream. (This is the exact problem Bundler was created to solve).

Code bloat: A gem dependency is added for one class/function, but the dependency gem has thousands of lines of code (internally and in its dependencies). That one class/function unnecessarily bloats the code base.

Compilation errors: I work in a cross-platform shop and we regularly have problems with C extension which don't compile. This is especially problematic when I require a pure-Ruby gem that has a downstream dependency which uses C extensions. I'be only recently started working with Java native extensions but I presume this can be a problem for JRuby, too. (This is why I pre-compile various builds of this gem.)

Abandonment: A dependency is added but the maintainers stop maintaining it.

The Ref gem passes two of these four metrics. It doesn't have any downstream dependencies and it is a very lean gem, so it's good for items 1 and 2 above. Not so much for items 3 and 4. With respect to item 3, Ref includes Java native extensions but it does not pre-compile those extensions. Therefore we cannot guarantee that any particular user of ours will be able to successfully install our gem on JRuby. With respect to item 4, Ref has not been updated since May of 2013. There are six open issues dating back to April 2012 and the maintainer has only responded to one of them. There is a PR that has been open since August of 2013 and the maintainer has not responded. As far as I can tell Ref has been abandoned by its maintainer.

I appreciate that it would be a bunch of extra work for us to implement our own weak key map, but we need to maintain the integrity of our own gem.

So I'm OK if we want to include Ref for now, but the best long-term solution is probably to build our own weak key map. Perhaps we can make that a goal for 1.0.

I may also reach out to the gem author. If Ref is no longer being actively maintained we may have an opportunity to bring it into the ruby-concurrency organization.

It's a very good point the 4th one. I agree we should either reimplement or pull the gem under concurrent-ruby in long-term. Technically it passes 3 because on JRuby the gem is not being loaded so gem 'ref', platform: :mri will avoid any problems with compilation on JRuby, that should make the short-term solution even more viable.

Good point about item 3. Even though we won't be using Ref in our JRuby gem, it will still be installed if we list the dependency in the gemspec. Compilation would fail on install even if we never load Ref at runtime. But now that I think it through, we can easily solve this. Since it isn't being used under JRuby I can added a guard in our gemspec to not require Ref when we create our pre-compiled JRuby build. Then Ref will never be installed.

I'll make the update on this branch later this evening.

This commit is only viable because we create a JRuby-specific gem build. The conditional statement in the gemspec file is not processed at runtime. It is only processed when the gem is built. Therefore it *should* prevent the `Ref` gem from being installed when using the JRuby-specific build. This has not been tested, however.

jdantonio · 2014-11-11T14:53:27Z

concurrent-ruby.gemspec

This commit is only viable because we create a JRuby-specific gem build. The conditional statement in the gemspec file is not processed at runtime. It is only processed when the gem is built. Therefore it should prevent the Ref gem from being installed when using the JRuby-specific build. This has not been tested, however.

Looks good to me 👍 I think we can merge this PR after you test the build.

Fix Thread leaking in ThreadLocalVariables Tested the exclusion of the `Ref` gem in the pre-compiled Java build (using our automated build process) and it worked as intended. The `Ref` gem will not be installed under JRuby when using the precompiled Java build.

jdantonio · 2014-11-24T18:27:21Z

@pitr-ch Ever since we've merged this PR, our builds have consistently failed on MRI 1.9.3. The test Concurrent::ThreadLocalVar GC does not leave values behind when bind is not used always fails with the same result, ["expected: == 1 got: 101"](Concurrent::ThreadLocalVar GC does not leave values behind when bind is not used). I just discovered this today so I haven't had a chance to look at the code. Any idea what might be happening?

pitr-ch · 2014-11-26T20:33:55Z

@jdantonio yeah this line 7d33d7e#diff-1de73cb618a4dedaa5d744e1dba7efc9R60 is not reliably triggering GC run. I'll try to figure something out.

pitr-ch mentioned this pull request Oct 5, 2014

Concurrent::ThreadLocalVar leaks threads #163

Closed

pitr-ch force-pushed the thread-local-var branch 2 times, most recently from 4be7c50 to 37cc349 Compare October 5, 2014 15:03

Fix Thread leaking in ThreadLocalVariables

da60120

- introduce #bind method to avoid value leaks - warn on using just #value=

pitr-ch force-pushed the thread-local-var branch 3 times, most recently from 35fd4a8 to 096a10c Compare November 8, 2014 16:22

Fixing ThreadLocalVar to use 'ref' gem

7d33d7e

ThreadLocalVar needs to be required manually.

pitr-ch force-pushed the thread-local-var branch from 096a10c to 7d33d7e Compare November 8, 2014 16:33

jdantonio reviewed Nov 9, 2014
View reviewed changes

jdantonio reviewed Nov 11, 2014
View reviewed changes

Removed Ref from the Gemfile.

1159d5a

pitr-ch mentioned this pull request Nov 12, 2014

Remove dependency on 'ref' gem #183

Closed

jdantonio merged commit 8f15a3e into master Nov 14, 2014

jdantonio deleted the thread-local-var branch November 14, 2014 12:41

pitr-ch restored the thread-local-var branch November 26, 2014 20:42

pitr-ch mentioned this pull request Nov 26, 2014

Count with GC may fail to run #196

Merged

eregon mentioned this pull request Jan 11, 2023

Fix ReentrantReadWriteLock implementation when Mutex is per-fiber. #983

Merged

Fix Thread leaking in ThreadLocalVariables #164

Fix Thread leaking in ThreadLocalVariables #164

Uh oh!

Conversation

pitr-ch commented Oct 5, 2014

Uh oh!

elspethsoup commented Oct 8, 2014

Uh oh!

chrisseaton commented Oct 8, 2014

Uh oh!

chrisseaton commented Oct 8, 2014

Uh oh!

pitr-ch commented Oct 9, 2014

Uh oh!

chrisseaton commented Oct 9, 2014

Uh oh!

pitr-ch commented Oct 9, 2014

Uh oh!

elspethsoup commented Oct 9, 2014

Uh oh!

pitr-ch commented Oct 10, 2014

Uh oh!

jdantonio commented Oct 13, 2014

Uh oh!

pitr-ch commented Oct 20, 2014

Uh oh!

pitr-ch commented Nov 8, 2014

Uh oh!

coveralls commented Nov 8, 2014

Uh oh!

jdantonio Nov 9, 2014

Choose a reason for hiding this comment

Uh oh!

pitr-ch Nov 10, 2014

Choose a reason for hiding this comment

Uh oh!

jdantonio Nov 10, 2014

Choose a reason for hiding this comment

Uh oh!

pitr-ch Nov 10, 2014

Choose a reason for hiding this comment

Uh oh!

jdantonio Nov 10, 2014

Choose a reason for hiding this comment

Uh oh!

jdantonio Nov 11, 2014

Choose a reason for hiding this comment

Uh oh!

pitr-ch Nov 12, 2014

Choose a reason for hiding this comment

Uh oh!

jdantonio commented Nov 24, 2014

Uh oh!

pitr-ch commented Nov 26, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants