-
-
Notifications
You must be signed in to change notification settings - Fork 19.4k
BUG: Inconsistent behavior of Groupby with None values with filter (#… #63178
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
BUG: Inconsistent behavior of Groupby with None values with filter (#… #63178
Conversation
rhshadrach
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the PR! Please always add tests. Does this also handle the tuple case on L667?
edd8a1f to
8d2126a
Compare
|
@koskampt - I opened #63202 to give some idea of what I'm thinking. If you like that, can incorporate it here. But still open to alternative solutions that do not iterate through Even with such a solution, will still want to see the result of running the groupby ASVs to evaluate performance impact. I can also help assist here if desired. |
|
@rhshadrach I had a look at your pull request and incorporated your suggestions in mine. I also made the change I am not familiar with the (groupby) ASVs, but I guess it referring to this: https://pandas.pydata.org/community/benchmarks.html. Help would be greatly appreciated, although I will through the docs by myself first. |
rhshadrach
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am not familiar with the (groupby) ASVs, but I guess it referring to this: https://pandas.pydata.org/community/benchmarks.html. Help would be greatly appreciated, although I will through the docs by myself first.
Correct - if you're using conda for your virtual environment, then this should be sufficient:
asv continuous -f 1.1 upstream/main HEAD -b ^groupby
…andas-dev#62501) - Add test cases - Add tuple support - Incorporate feedback
BUG: Inconsistent behavior of Groupby with None values with filter
2ef342b to
f7c5e23
Compare
|
I was able to get the asv up and running (a couple of days ago). I will run the benchmark with the below command and report back the results.
|
|
Just checking, I also went through |
|
@rhshadrach, I was able to run asv with the command you mentioned. During the benchmark run I did not use my computer. The results can be found below:
SOME BENCHMARKS HAVE CHANGED SIGNIFICANTLY. |
Indeed, in addition to
So seems like a pretty limited surface area for performance impact, and I do not see a more performant way to do this that would be limited in scope. Would like another eye here - cc @jbrockmendel. |
…62501)
doc/source/whatsnew/v2.3.4.rstfile if fixing a bug or adding a new feature.