Spark window functions can be used to compute SQL window functions.
- nth_value(x, offset) [same as input]
Returns the value at the specified offset from the beginning of the window. Offsets start at 1. The offset should be a positive int literal. If the offset is greater than the number of values in the window, null is returned. It is an error for the offset to be zero or negative.
- row_number() integer ¶
Returns a unique, sequential number to each row, starting with one, according to the ordering of rows within the window partition.
- rank() integer ¶
Returns the rank of a value in a group of values. The rank is one plus the number of rows preceding the row that are not peer with the row. Thus, the values in the ordering will produce gaps in the sequence. The ranking is performed for each window partition.
- dense_rank() integer ¶
Returns the rank of a value in a group of values. This is similar to rank(), except that tie values do not produce gaps in the sequence.