On Sat, Sep 30, 2017 at 07:41:56AM +0800, Boqun Feng wrote:
On Fri, Sep 29, 2017 at 04:43:39PM +0000, Paul E. McKenney wrote:
Not to be repetitive, but if the schedule() is on the guest, this change >>> really does silently break up an RCU read-side critical section on
guests built with PREEMPT=n. (Yes, they were already being broken,
but it would be good to avoid this breakage in PREEMPT=n as well as
in PREEMPT=y.)
Then probably adding !IS_ENABLED(CONFIG_PREEMPT) as one of the reason we
choose the halt path? Like:
n.halted = is_idle_task(current) || preempt_count() > 1 ||
!IS_ENABLED(CONFIG_PREEMPT) || rcu_preempt_depth();
But I think async PF could also happen while a user program is running?
Then maybe add a second parameter @user for kvm_async_pf_task_wait(),
like:
kvm_async_pf_task_wait((u32)read_cr2(), user_mode(regs));
and the halt condition becomes:
n.halted = is_idle_task(current) || preempt_count() > 1 ||
(!IS_ENABLED(CONFIG_PREEMPT) && !user) || rcu_preempt_depth();
Thoughts?
This looks to me like it would cover it. If !PREEMPT interrupt from
kernel, we halt, which would prevent the sleep.
I take it that we get unhalted when the host gets things patched up?
A side thing is being broken already for PREEMPT=n means we maybe fail
to detect this in rcutorture? Then should we add a config with
KVM_GUEST=y and try to run some memory consuming things(e.g. stress
--vm) in the rcutorture kvm script simultaneously? Paolo, do you have
any test workload that could trigger async PF quickly?
I do not believe that have seen this in rcutorture, but I always run in
a guest OS on a large-memory system (well, by my old-fashioned standards, anyway) that would be quite unlikely to evict a guest OS's pages. Plus
I tend to run on shared systems, and deliberately running them out of
memory would not be particularly friendly to others using those systems.
I -do- run background scripts that are intended to force the host OS to preempt the guest OSes frequently, but I don't believe that this would
cause that bug.
But it seems like it would make more sense to add this sort of thing to whatever KVM tests there are for host-side eviction of guest pages.
Sysop: | Keyop |
---|---|
Location: | Huddersfield, West Yorkshire, UK |
Users: | 293 |
Nodes: | 16 (2 / 14) |
Uptime: | 212:31:57 |
Calls: | 6,619 |
Calls today: | 1 |
Files: | 12,168 |
Messages: | 5,317,375 |