Claude review: Re: [PATCH v2] cgroup/dmem: implement dmem.high soft limit via prioritized eviction

public inbox for drm-ai-reviews@public-inbox.freedesktop.org
 help / color / mirror / Atom feed

From: Claude Code Review Bot <claude-review@example.com>
To: dri-devel-reviews@example.com
Subject: Claude review: Re: [PATCH v2] cgroup/dmem: implement dmem.high soft limit via prioritized eviction
Date: Mon, 25 May 2026 19:12:32 +1000	[thread overview]
Message-ID: <review-patch1-f4ce5a78-62a0-4f99-87cf-f5674195f106@gmx.de> (raw)
In-Reply-To: <f4ce5a78-62a0-4f99-87cf-f5674195f106@gmx.de>

Patch Review

**Design concern: `try_high` bypasses `emin`/`elow` protections**

In `dmem_cgroup_state_evict_valuable()`, when `try_high` is true, the function short-circuits before the `emin`/`elow` protection checks:

```c
+	if (try_high)
+		return used > READ_ONCE(ctest->high);
+
 	dmem_cgroup_calculate_protection(limit_pool, test_pool);
-	used = page_counter_read(ctest);
 	min = READ_ONCE(ctest->emin);
 	if (used <= min)
 		return false;
```

This means pass 1 will target a cgroup that is above `high` even if it is within its `emin` guarantee. In the memory cgroup controller, reclaim triggered by `memory.high` still respects `memory.min` protection — `min` is an absolute guarantee. The dmem controller documents `min` the same way (via `page_counter` semantics). If the intent is to make `high` override `min`, that should be explicitly documented and justified; otherwise the `try_high` path should check `emin` too. Consider:

```c
	if (try_high) {
		if (used <= READ_ONCE(ctest->emin))
			return false;
		return used > READ_ONCE(ctest->high);
	}
```

Note that if `high >= min` is always true in practice (which is the expected configuration), this wouldn't change behavior, but it preserves the `min` guarantee contract for correctness.

**Design concern: no `memory.events` style tracking for high limit hits**

The memory cgroup controller tracks `high` limit events via `memory.events`. There is no equivalent tracking here (e.g., counting how many times BOs were evicted due to a cgroup being above its `high` limit). This would be useful for user-space monitoring of whether the `high` limit is having an effect.

**Missing documentation**

There is no update to `Documentation/admin-guide/cgroup-v2.rst` or similar documentation file describing the new `dmem.high` control file, its semantics, default value, and interaction with `min`/`low`/`max`. Kernel cgroup interfaces require documentation for new control knobs.

**Missing selftests**

No selftest or test case is provided. At minimum, a test verifying that BOs from a cgroup over `high` are evicted before BOs from a cgroup under `high` would demonstrate the feature works.

**Typo in commit message**

> Pass 2 falls back to the standard above-elow eviction.

"above-elow" should be "above effective-low" or "above-elow (effective low)".

**Pass 1 early exit catches errors too**

```c
+	evict_walk.try_high = true;
 	lret = ttm_lru_walk_for_evict(&evict_walk.walk, bdev, man, 1);
+	evict_walk.try_high = false;
+	if (lret)
+		goto out;
```

`lret` is negative on error, positive on success. The `if (lret)` check exits on both. If pass 1 encounters an error (e.g., `-ENOSPC` mapped to `-EBUSY` in the callback), this skips the standard passes 2 and 3, which differ in what they're willing to evict. In the original code, an error in the trylock pass would still fall through to `if (lret || !ticket) goto out;`, so this is a minor behavioral change: an error that previously reached the `hit_low` retry no longer does. In practice this is unlikely to matter since errors in LRU walks are typically terminal, but it's worth noting.

**`used` read hoisted above protection calculation — correct but worth noting**

```c
 	ctest = &test_pool->cnt;
+	used = page_counter_read(ctest);
+	...
+	if (try_high)
+		return used > READ_ONCE(ctest->high);
+
 	dmem_cgroup_calculate_protection(limit_pool, test_pool);
-	used = page_counter_read(ctest);
```

Moving `page_counter_read()` above `dmem_cgroup_calculate_protection()` is safe since the read doesn't depend on the protection calculation. Both the `try_high` and non-`try_high` paths now use the same `used` value, which is consistent.

**Cgroupfs file registration looks correct**

The `high` entry is placed between `low` and `max` in the `cftype files[]` array, matching the cgroup v2 convention:

```c
+	{
+		.name = "high",
+		.write = dmem_cgroup_region_high_write,
+		.seq_show = dmem_cgroup_region_high_show,
+		.flags = CFTYPE_NOT_ON_ROOT,
+	},
```

**Initialization is correct**

`page_counter_init()` already sets `high = PAGE_COUNTER_MAX`, and `reset_all_resource_limits()` now also resets it:

```c
+	set_resource_high(rpool, PAGE_COUNTER_MAX);
```

This means existing pools and newly created pools will both default to unlimited, so the feature is opt-in. Correct.

**Stub function for !CONFIG_CGROUP_DMEM is correct**

```c
 bool dmem_cgroup_state_evict_valuable(struct dmem_cgroup_pool_state *limit_pool,
 				      struct dmem_cgroup_pool_state *test_pool,
-				      bool ignore_low, bool *ret_hit_low)
+				      bool try_high, bool ignore_low, bool *ret_hit_low)
 {
 	return true;
 }
```

When dmem cgroup is disabled, the stub ignores `try_high` and returns `true`, so pass 1 behaves identically to pass 2 — first eligible BO gets evicted. This adds one extra (no-op) LRU walk but is functionally correct.

**Summary**: The approach is sound and a significant improvement over v1. The main concerns are: (1) the `try_high` path should respect `emin` protection to maintain the `min` guarantee contract, (2) documentation and selftests are needed, and (3) consider adding event tracking. The code is otherwise clean and correctly uses existing infrastructure.

---
Generated by Claude Code Patch Reviewer

next prev parent reply	other threads:[~2026-05-25  9:12 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-05-22  8:28 [PATCH v2] cgroup/dmem: implement dmem.high soft limit via prioritized eviction Qiliang Yuan
2026-05-22  9:30 ` Natalie Vock
2026-05-25  9:12   ` Claude Code Review Bot [this message]
2026-05-25  9:12   ` Claude review: " Claude Code Review Bot

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=review-patch1-f4ce5a78-62a0-4f99-87cf-f5674195f106@gmx.de \
    --to=claude-review@example.com \
    --cc=dri-devel-reviews@example.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox