Announcement

**darkbasic** · 15 October 2018, 07:02 AM

Do we already have those kind of optimizations in Linux?

**ms178** · 15 October 2018, 07:11 AM

We certainly don't have those kind of optimizations in Windows.

**Guest** · 15 October 2018, 08:53 AM

Man, low level C sometimes makes me want to gouge my eyeballs. From his patch:

Code:

while (vpq->lcnt < lcnt_lo) {
    struct vpgqueues *vptmp;

    iter = (iter + 1) & PQ_L2_MASK;
    vptmp = &vm_page_queues[PQ_FREE + iter];
    if (vptmp->lcnt < lcnt_hi)
        continue;
    m = TAILQ_FIRST(&vptmp->pl);
    KKASSERT(m->queue == PQ_FREE + iter);
    TAILQ_REMOVE(&vptmp->pl, m, pageq);
    --vptmp->lcnt;
    /* queue doesn't change, no need to adj cnt */
    m->queue -= m->pc;
    m->pc = i;
    m->queue += m->pc;
    TAILQ_INSERT_HEAD(&vpq->pl, m, pageq);
    ++vpq->lcnt;
}

**GruenSein** · 15 October 2018, 09:21 AM

Is there any way to measure how much progress a thread is making in its scheduled time? If so, a heuristic algorithm might also be very effective. It could permute the thread layout and determine which one runs best on the "remote" cores without direct memory access. My department is using a similar approach to optimize load balancing problems one HPC machines with nodes of different performance levels. The nice feature of this is approach is that is doesn't actually need to know anything about the nodes. It simply tracks the process and redistributes the workload, if one node isn't performing in sync with the others. A similar approach could potentially be done for the memory domains in NUMA systems.

Announcement

DragonFlyBSD Lands Another NUMA Optimization Helping AMD Threadripper 2 CPUs

DragonFlyBSD Lands Another NUMA Optimization Helping AMD Threadripper 2 CPUs

Comment

Comment

Comment

Comment