Add 347. Top K Frequent Elements.md by t0hsumi · Pull Request #9 · t0hsumi/leetcode

t0hsumi · 2024-12-11T01:38:10Z

https://leetcode.com/problems/top-k-frequent-elements/description/

hayashi-ay · 2024-12-11T07:16:27Z

347. Top K Frequent Elements.md

+```python
+class Solution:
+    def topKFrequent(self, nums: List[int], k: int) -> List[int]:
+        occurrences: dict[Tuple(int, int)] = {}


Type Hintが違うような気がします。 dict[int, int]かなと。
https://docs.python.org/3/library/typing.html#generics

ご指摘ありがとうございます。間違ってますね。

自分の方でも確認しました。https://docs.python.org/3/library/stdtypes.html#types-genericalias:~:text=Another%20example%20for,their%20second%20argument%3A

thonda28 · 2024-12-11T09:49:45Z

347. Top K Frequent Elements.md

+frequencyを数える->frequencyを降順にソートする->値の大きいものから取り出してリストを作成が基本的な流れだと思った。
+
+frequencyを数える作業をどうするか迷った。とりあえず思いついたのは、与えられた`nums`をソートして
+それに対してループで走査してfrequencyを数える方法。`nums`が与えられているので、`sorted_numbers`ではなく`sorted_nums`と略しても許容範囲かと判断した。与えられた`nums`からfrequencyを数える時にも、frequencyの大きいものを順に取り出す時にもheapは使えるが、計算量が変わらない上にpythonのデフォルトはmin-heapなので`(-1 * number_of_appears, number)`にする必要があり、複雑に感じたので使わなかった。


計算量が変わらない上にpythonのデフォルトはmin-heapなので(-1 * number_of_appears, number)にする必要があり、複雑に感じたので使わなかった。

min-heap を max-heap として無理やり使うのが複雑であるというのは僕も同意です。（今回は使用しないという判断なので関係ないですが）仮に使う場合はコメントを残すことは必須かなと個人的には思っています。

thonda28 · 2024-12-11T09:57:32Z

347. Top K Frequent Elements.md

+    if num not in occurrences:
+        occurrences[num] = 1
+    else:
+        occurrences[num] += 1


以下書き方のほうが個人的には好きです。

Suggested change

if num not in occurrences:

occurrences[num] = 1

else:

occurrences[num] += 1

if num not in occurrences:

occurrences[num] = 0

occurrences[num] += 1

goto-untrapped · 2024-12-11T14:55:45Z

347. Top K Frequent Elements.md

+            while index < len(sorted_nums) and sorted_nums[index] == number:
+                index += 1
+                number_of_appears += 1
+            frequency.append((number_of_appears, number))


個人的な好みのお話で、ここの while - while について他の書き方もしてみたくなりました。
下記、ループの外側で最後の追加をしていて微妙ですが、こんな感じにも書けそうです。

if len(sorted_nums) == 0: return [] number: int = sorted_nums[0] number_of_appears: int = 1 for i in range(1, len(sorted_nums)): if sorted_nums[i] == number: number_of_appears += 1 continue frequency.append((number_of_appears, number)) number = sorted_nums[i] number_of_appears = 0 frequency.append((number_of_appears, number))

コメントありがとうございます。

こちらの方はネストが少ない分の読みやすさがありますね。

oda · 2024-12-11T15:01:39Z

347. Top K Frequent Elements.md

+        occurrences[num] += 1
+    ```
+- dictを用いない場合では、Counterを使い、`most_common`を呼び出す。
+    https://docs.python.org/3/library/collections.html#collections.Counter


いいですね。
せっかくなので実装も見ておきましょう。何を使っていて自分のと比べてどう思いますか。
https://github.com/python/cpython/blob/main/Lib/collections/__init__.py

共通点

Counterもdefaultdictもdict class(hash tableで実装)を継承している。

https://github.com/python/cpython/blob/main/Objects/dictobject.c

https://github.com/python/cpython/blob/main/Lib/collections/__init__.py#L551

https://github.com/python/cpython/blob/main/Lib/test/test_descrtut.py#L16

相違点

keyが存在しない時にCounterは0を返すが、defaultdictは与えられた型でのデフォルトコンストラクタを呼び出す。もし型の指定がなければ、Noneを返す。

https://github.com/python/cpython/blob/main/Lib/collections/__init__.py#L616-L619

https://github.com/python/cpython/blob/main/Lib/test/test_descrtut.py#L24-L25

https://docs.python.org/3/library/collections.html#collections.defaultdict.default_factory

Counterのmost_commonについては、heapq.nlargestを使っている。

https://github.com/python/cpython/blob/main/Lib/collections/__init__.py#L642

heapq.nlargestはsorted methodを内部で使っている

https://docs.python.org/3/library/heapq.html#heapq.nlargest

https://github.com/python/cpython/blob/main/Lib/heapq.py#L523

defaultdictを使った今回の自分の解法では、新たにheapを用意して、heapq.heappopとheapq.heappushを繰り返して、指定された要素数だけ値が収まる様にしている。

思ったこと
今回は数字の出現頻度を数える問題だったので、それに特化したCounterの方が余計にheapを用意する必要もなく、見やすい様に感じました。
ただ、もともと用意されているCounterのmethodを用いた実装と、defaultdictで足りない出現頻度の高い値を求める部分を自分でheapを使って調べた実装とでやっていることがそう大きくは変わらない印象も受けました。

oda · 2024-12-11T15:02:27Z

347. Top K Frequent Elements.md

+            num_to_count[num] += 1
+
+        if not 0 < k <= len(num_to_count):
+            raise ValueError(


https://docs.python.org/3/library/exceptions.html#ValueError
時々、どのような Exception が定義されているか眺めてみましょう。

Add 347. Top K Frequent Elements.md

709aa50

hayashi-ay reviewed Dec 11, 2024

View reviewed changes

thonda28 reviewed Dec 11, 2024

View reviewed changes

goto-untrapped reviewed Dec 11, 2024

View reviewed changes

oda reviewed Dec 11, 2024

View reviewed changes

Add step 4

bfc45a8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add 347. Top K Frequent Elements.md#9

Add 347. Top K Frequent Elements.md#9
t0hsumi wants to merge 2 commits intomainfrom
347

t0hsumi commented Dec 11, 2024

Uh oh!

hayashi-ay Dec 11, 2024

Uh oh!

t0hsumi Dec 11, 2024

Uh oh!

thonda28 Dec 11, 2024

Uh oh!

thonda28 Dec 11, 2024

Uh oh!

goto-untrapped Dec 11, 2024

Uh oh!

t0hsumi Dec 11, 2024

Uh oh!

oda Dec 11, 2024

Uh oh!

t0hsumi Dec 12, 2024

Uh oh!

oda Dec 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

t0hsumi commented Dec 11, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants