FIX: ignore numpy-style default values in docstrings #210

mazer-ai · 2025-01-23T04:49:39Z

Problem: The numpy format parser from docstring_parser doesn't remove the default value information from type_name, resulting in a mismatch between the name in the function signature and the docstring for any parameters specifying a default value in the docstring.

The numpy style doc is vague about "correct" way to specify default values, this patch handles two common patterns:

Parameters
-----------
foo: Optional[int] = 10
    description of foo here

bar: Optional[int], default is 10
    descriptiopn of bar here

Both of which appear in numpy code and numpy-related projects.

Note that this could also be handled in docstring_parser, but since current pydoclint depends on.a forked version of the package, I opted to handle here it here, where it seemed simpler.

Also, this could be done using a regex to search for and exclude the default info, but poking around, the general consensus seems to be that use of in or .find() for short, static strings is faster than using re.match for
something like this.

Finally, this problem doesn't occur for ReST formatted docstrings, and the specification for default values in google style docstrings is even more poorly defined than for numpy, but from the examples I found on-line, the current parser works fine. So this seems to really just be a numpy issue.

jsh9 · 2025-01-26T04:40:24Z

Thanks! Let me take a look.

mazer-ai · 2025-01-27T07:13:26Z

Shoot -- are those CI errors from my changes? The tests run clean on my local python 3.12 install (macOS ventura).
I'm having a bit of trouble parsing the errors -- seems to have something to do with a \_ escape sequence in test_args.py, but can't figure out how anything I touched would change that.

jsh9 · 2025-01-29T05:31:15Z

pydoclint/utils/doc.py

+            #      bar: int = 10        # noqa: E800
+            for k, metadata in enumerate(self.parsed.meta):
+                if metadata.args[0] == 'param':
+                    # use of `in` can be replaced with a pre-compiled `re`, but


Hi @mazer-ai , could you double check your comment here? Because I don't see the use of in here. Thanks!

Sorry -- originally used in and then replaced with .find so I could get the substring position in one go.. Both seem faster than trying to use an regex here.. Will update comments.

jsh9 · 2025-01-29T05:32:41Z

Hi @mazer-ai , could you add some test cases to check your code changes in this PR? Thank you!

jsh9 · 2025-01-29T05:36:45Z

pydoclint/utils/doc.py

+            #   supports a couple different specs:
+            #      Parameters
+            #      ----------
+            #      foo: int, default 10


Can we support all the 3 styles mentioned here?

And could you also add a note in the documentation (at least in docs/notes_for_users.md, and preferably also in Section 2.7 of README)?

Thanks!

These are all supported. I added some examples to the new test to confirm.

jsh9 · 2025-01-29T05:37:48Z

Shoot -- are those CI errors from my changes? The tests run clean on my local python 3.12 install (macOS ventura). I'm having a bit of trouble parsing the errors -- seems to have something to do with a \_ escape sequence in test_args.py, but can't figure out how anything I touched would change that.

Hi @mazer-ai , after you make changes, you can run pre-commit run --all-files to auto-format the code, and also run tox to check the CI pipeline locally. (You'd need to install pre-commit and tox in your development environment.)

mazer-ai · 2025-01-30T01:30:20Z

Shoot -- are those CI errors from my changes? The tests run clean on my local python 3.12 install (macOS ventura). I'm having a bit of trouble parsing the errors -- seems to have something to do with a \_ escape sequence in test_args.py, but can't figure out how anything I touched would change that.

Hi @mazer-ai , after you make changes, you can run pre-commit run --all-files to auto-format the code, and also run tox to check the CI pipeline locally. (You'd need to install pre-commit and tox in your development environment.)

@jsh9 When I run tox locally, I get those same errors from test_args.py (sorry, first time I've used tox -- used to just running pytest by hand):

tests/utils/test_arg.py
<unknown>:19: SyntaxWarning: invalid escape sequence '\_'
<unknown>:208: SyntaxWarning: invalid escape sequence '\_'
<unknown>:209: SyntaxWarning: invalid escape sequence '\_'
<unknown>:322: SyntaxWarning: invalid escape sequence '\_'
<unknown>:323: SyntaxWarning: invalid escape sequence '\_'

Not sure why these escape sequences are present in test_args.py, but they seem to be illegal escape sequences:

mazer@bridger $ python
Python 3.12.4 (main, Jun  6 2024, 18:26:44) [Clang 15.0.0 (clang-1500.1.0.2.5)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> 'arg1\_\_'
<stdin>:1: SyntaxWarning: invalid escape sequence '\_'
'arg1\\_\\_'
>>>

Any idea if I'm missing something obvious here? Or is this something strange about my dev env?

Sorry about turning something simple into something complicated...

jsh9 · 2025-01-30T07:18:57Z

Shoot -- are those CI errors from my changes? The tests run clean on my local python 3.12 install (macOS ventura). I'm having a bit of trouble parsing the errors -- seems to have something to do with a \_ escape sequence in test_args.py, but can't figure out how anything I touched would change that.

Hi @mazer-ai , after you make changes, you can run pre-commit run --all-files to auto-format the code, and also run tox to check the CI pipeline locally. (You'd need to install pre-commit and tox in your development environment.)

@jsh9 When I run tox locally, I get those same errors from test_args.py (sorry, first time I've used tox -- used to just running pytest by hand):
tests/utils/test_arg.py
<unknown>:19: SyntaxWarning: invalid escape sequence '\_'
<unknown>:208: SyntaxWarning: invalid escape sequence '\_'
<unknown>:209: SyntaxWarning: invalid escape sequence '\_'
<unknown>:322: SyntaxWarning: invalid escape sequence '\_'
<unknown>:323: SyntaxWarning: invalid escape sequence '\_'
Not sure why these escape sequences are present in test_args.py, but they seem to be illegal escape sequences:
mazer@bridger $ python
Python 3.12.4 (main, Jun  6 2024, 18:26:44) [Clang 15.0.0 (clang-1500.1.0.2.5)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> 'arg1\_\_'
<stdin>:1: SyntaxWarning: invalid escape sequence '\_'
'arg1\\_\\_'
>>>
Any idea if I'm missing something obvious here? Or is this something strange about my dev env?

Sorry about turning something simple into something complicated...

Hi, don't worry about those warnings. Those existed before your PR.

mazer-ai · 2025-01-30T22:15:00Z

@jsh9 I think this should address your comments. Let me know if you see anything else.

FIX: ignore numpy-style default values in docstrings

4b06492

jsh9 added 4 commits January 29, 2025 00:26

Auto-format code

377539d

Fix E800 violation

32dcfa6

Fix mypy violations

79f2124

Remove redundant blank line

46aa2d7

jsh9 reviewed Jan 29, 2025

View reviewed changes

mazer-ai added 3 commits January 30, 2025 13:51

Add tests for ignoring defaults in numpy-style docstrings

195de9d

Removed obsolete comment about use of in

269fef5

Add info about default values in docstrings to docs

d7f6e26

mazer-ai requested a review from jsh9 January 30, 2025 22:15

Fix typo

8793593

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX: ignore numpy-style default values in docstrings #210

FIX: ignore numpy-style default values in docstrings #210

mazer-ai commented Jan 23, 2025 •

edited

Loading

jsh9 commented Jan 26, 2025

mazer-ai commented Jan 27, 2025

jsh9 Jan 29, 2025

mazer-ai Jan 29, 2025

jsh9 commented Jan 29, 2025

jsh9 Jan 29, 2025

mazer-ai Jan 30, 2025

jsh9 commented Jan 29, 2025

mazer-ai commented Jan 30, 2025 •

edited

Loading

jsh9 commented Jan 30, 2025

mazer-ai commented Jan 30, 2025

FIX: ignore numpy-style default values in docstrings #210

Are you sure you want to change the base?

FIX: ignore numpy-style default values in docstrings #210

Conversation

mazer-ai commented Jan 23, 2025 • edited Loading

jsh9 commented Jan 26, 2025

mazer-ai commented Jan 27, 2025

jsh9 Jan 29, 2025

Choose a reason for hiding this comment

mazer-ai Jan 29, 2025

Choose a reason for hiding this comment

jsh9 commented Jan 29, 2025

jsh9 Jan 29, 2025

Choose a reason for hiding this comment

mazer-ai Jan 30, 2025

Choose a reason for hiding this comment

jsh9 commented Jan 29, 2025

mazer-ai commented Jan 30, 2025 • edited Loading

jsh9 commented Jan 30, 2025

mazer-ai commented Jan 30, 2025

mazer-ai commented Jan 23, 2025 •

edited

Loading

mazer-ai commented Jan 30, 2025 •

edited

Loading