feat: `astype`: accept a kind of data type #848

lucascolley · 2024-10-24T14:18:27Z

#841 (comment) @rgommers

src/array_api_stubs/_draft/data_type_functions.py

rgommers

Thanks for getting the ball rolling on this @lucascolley! Some comments inline.

src/array_api_stubs/_draft/data_type_functions.py

rgommers · 2024-10-26T05:13:20Z

src/array_api_stubs/_draft/data_type_functions.py

+        For ``dtype_or_kind`` a data type, an array having the specified data type.
+        For ``dtype_or_kind`` a kind of data type:
+        -   If ``x.dtype`` is already of that kind, the data type is maintained.
+        -   Otherwise, an attempt is made to convert to the specified kind, according to the type promotion rules (see :ref:`type-promotion`).


Why "an attempt"? That seems ambiguous. We have to be clear about what must work. Which I think is:

float to complex

unsigned to signed integer

Anything else doesn't I think? There's no point allowing 'bool' I think, since there is only one boolean dtype so dtype=xp.bool will be cleaner.

For 'signed integer' and 'real floating-point'` there are also no promotion rules to follow, so they can be left out - or do you see a use case?

I've reduced this down to just 'complex floating' (use-case: mixed float/complex to complex) and 'signed integer' (use-case: mixed signed/unsigned to signed).

I think "an attempt" would still be accurate for an implementation of this? xp.astype(some_int8_array, 'complex floating') would attempt a conversion, whose success will depend on the implementation-specific type promotion rules, right?

Unless you think that this function should always error when the type promotion is not defined by the standard?

I think "an attempt" would still be accurate for an implementation of this?

I think you have the right idea in mind here, it's just a "language we use to specify things" thing. We specify which behavior has to be supported - 'complex floating' has type promotion rules defined in the standard, so it's expected to always work for a compliant implementation. Then, if we expect other input types to raise, then we specify that by "must raise ..." or "input type must be ...". In this case there's no reason to do that (implementors are free to suppport more types, it's just not standardized), so we then say "input type should be ...".

Your "attempt to ..." seems to be the same as "should be ...", it's just language we want to write in a uniform way.

how about the wording now?

asmeurer · 2024-11-27T18:55:25Z

src/array_api_stubs/_draft/data_type_functions.py

+        -   If ``x.dtype`` is already of that kind, the data type must be maintained.
+        -   Otherwise, ``x`` should be cast to a data type of that kind, according to the type promotion rules (see :ref:`type-promotion`) and the above notes.
+        -   Kinds must be interpreted as the lowest-precision standard data type of that kind for the purposes of type promotion. For example, ``astype(x, 'complex floating')`` will return an array with the data type ``complex64`` when ``x.dtype`` is ``float32``, since ``complex64`` is the result of promoting ``float32`` with the lowest-precision standard complex data type, ``complex64``.
+        -   Where type promotion is unspecified and thus implementation-specific, the result is also unspecified. For example, ``astype(x, 'complex floating')``, where ``x`` has data type ``int32``.


This is overly restrictive, and conflicts with the behavior specified above. The whole point of astype is to be able to do casts that aren't in the promotion table, like int -> float casts. In fact, I would say this whole line should be deleted as the above note already clearly talks about what is and isn't defined.

This is overly restrictive, and conflicts with the behavior specified above. The whole point of astype is to be able to do casts that aren't in the promotion table, like int -> float casts

I think there is a misunderstanding - this point applies only in the case that a kind of data type is provided, as this is under the bullet list started on line 63.

Ralf suggested in #848 (comment) that it doesn't make sense to support kinds for which the resulting dtype would always be undefined, given that there are no type promotion rules to follow. Do you disagree?

asmeurer · 2024-11-27T18:56:26Z

src/array_api_stubs/_draft/data_type_functions.py

+
+        -   If ``x.dtype`` is already of that kind, the data type must be maintained.
+        -   Otherwise, ``x`` should be cast to a data type of that kind, according to the type promotion rules (see :ref:`type-promotion`) and the above notes.
+        -   Kinds must be interpreted as the lowest-precision standard data type of that kind for the purposes of type promotion. For example, ``astype(x, 'complex floating')`` will return an array with the data type ``complex64`` when ``x.dtype`` is ``float32``, since ``complex64`` is the result of promoting ``float32`` with the lowest-precision standard complex data type, ``complex64``.


The text you've added here seems to only be thinking about the case where the dtype argument is a string, but the dtype being an actual dtype object is also still supported.

Line 62 covers the dtype case. Do you have a suggestion to make it more clear?

lucascolley · 2024-12-13T10:45:24Z

Gentle reminder of the 2024 milestone here!

feat: astype: accept a kind of data type

2a82758

lucascolley commented Oct 24, 2024

View reviewed changes

src/array_api_stubs/_draft/data_type_functions.py Show resolved Hide resolved

lucascolley added 2 commits October 24, 2024 14:24

try to fix docs build

419041b

change signature

302cf1a

lucascolley marked this pull request as ready for review October 24, 2024 14:33

run pre-commit

3186995

rgommers added the API change Changes to existing functions or objects in the API. label Oct 26, 2024

rgommers reviewed Oct 26, 2024

View reviewed changes

lucascolley added 4 commits October 27, 2024 16:20

address review comments

90aa750

adjust wording

41880c0

add note on unspecified promotion

d4450b3

improve formatting

1b056ad

lucascolley mentioned this pull request Oct 28, 2024

ENH: real and complex dtype functions data-apis/array-api-extra#13

Open

kgryte added this to the v2024 milestone Oct 31, 2024

kgryte self-requested a review October 31, 2024 05:40

kgryte added the Needs Review Pull request which needs review. label Oct 31, 2024

lucascolley mentioned this pull request Nov 26, 2024

How to infer appropriate dtype from uint to int and float to complex? #859

Open

asmeurer reviewed Nov 27, 2024

View reviewed changes

lucascolley mentioned this pull request Dec 9, 2024

ENH: signal.vectorstrength: add array API standard support scipy/scipy#22008

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: `astype`: accept a kind of data type #848

feat: `astype`: accept a kind of data type #848

lucascolley commented Oct 24, 2024

rgommers left a comment

rgommers Oct 26, 2024

lucascolley Oct 27, 2024

rgommers Oct 27, 2024

lucascolley Oct 28, 2024

asmeurer Nov 27, 2024

lucascolley Nov 27, 2024

asmeurer Nov 27, 2024

lucascolley Nov 27, 2024

lucascolley commented Dec 13, 2024

feat: astype: accept a kind of data type #848

Are you sure you want to change the base?

feat: astype: accept a kind of data type #848

Conversation

lucascolley commented Oct 24, 2024

rgommers left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lucascolley commented Dec 13, 2024

feat: `astype`: accept a kind of data type #848

feat: `astype`: accept a kind of data type #848