Announcement

Collapse
No announcement yet.

RIFS-ES Linux Kernel Scheduler Released

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #61
    Originally posted by kernelOfTruth View Post
    oh nice !

    a fixed priority preemptive scheduling policy


    I just tried it and the kernel hang during bringing up the CPUs so it must be an issue with the cpu scheduler

    patched RIFS.ES-v1-low-spec-kernel3.4.x first and then replaced rifs.c with the one from "DMS-3.4.x V1(patch your kernel with RIFS-ES first)"


    any ideas ?


    thanks !
    Could I get the calltrace? Thanks.

    Comment


    • #62
      Originally posted by 3766691 View Post
      Could I get the calltrace? Thanks.
      unfortunately it doesn't say anything useful / nothing at all:

      if I remember correctly it hangs at:

      [ 0.000000] KERNEL supported cpus:
      [ 0.000000] Intel GenuineIntel
      [ 0.000000] AMD AuthenticAMD
      [ 0.000000] BIOS-provided physical RAM map:
      [ 0.000000] BIOS-e820: 0000000000000000 - 000000000009dc00 (usable)
      [ 0.000000] BIOS-e820: 000000000009dc00 - 00000000000a0000 (reserved)
      [ 0.000000] BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved)
      [ 0.000000] BIOS-e820: 0000000000100000 - 00000000bf780000 (usable)
      [ 0.000000] BIOS-e820: 00000000bf780000 - 00000000bf78e000 (ACPI data)
      [ 0.000000] BIOS-e820: 00000000bf78e000 - 00000000bf7d0000 (ACPI NVS)
      [ 0.000000] BIOS-e820: 00000000bf7d0000 - 00000000bf7e0000 (reserved)
      [ 0.000000] BIOS-e820: 00000000bf7ed000 - 00000000c0000000 (reserved)
      [ 0.000000] BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
      [ 0.000000] BIOS-e820: 00000000ffb00000 - 0000000100000000 (reserved)
      [ 0.000000] BIOS-e820: 0000000100000000 - 0000000240000000 (usable)
      [ 0.000000] NX (Execute Disable) protection: active
      [ 0.000000] DMI present.
      [ 0.000000] DMI: Packard Bell ipower G3710/FMP55, BIOS P01-B2C0 03/29/2010
      [ 0.000000] e820 update range: 0000000000000000 - 0000000000010000 (usable) ==> (reserved)
      [ 0.000000] e820 remove range: 00000000000a0000 - 0000000000100000 (usable)
      [ 0.000000] No AGP bridge found
      [ 0.000000] last_pfn = 0x240000 max_arch_pfn = 0x400000000
      [ 0.000000] MTRR default type: uncachable
      [ 0.000000] MTRR fixed ranges enabled:
      [ 0.000000] 00000-9FFFF write-back
      [ 0.000000] A0000-BFFFF uncachable
      [ 0.000000] C0000-D3FFF write-protect
      [ 0.000000] D4000-DFFFF uncachable
      [ 0.000000] E0000-E3FFF write-protect
      [ 0.000000] E4000-E7FFF write-through
      [ 0.000000] E8000-EBFFF write-protect
      [ 0.000000] EC000-EFFFF write-through
      [ 0.000000] F0000-FFFFF write-protect
      [ 0.000000] MTRR variable ranges enabled:
      [ 0.000000] 0 base 000000000 mask E00000000 write-back
      [ 0.000000] 1 base 200000000 mask FC0000000 write-back
      [ 0.000000] 2 base 0C0000000 mask FC0000000 uncachable
      [ 0.000000] 3 disabled
      [ 0.000000] 4 disabled
      [ 0.000000] 5 disabled
      [ 0.000000] 6 disabled
      [ 0.000000] 7 disabled
      [ 0.000000] x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106
      [ 0.000000] original variable MTRRs
      [ 0.000000] reg 0, base: 0GB, range: 8GB, type WB
      [ 0.000000] reg 1, base: 8GB, range: 1GB, type WB
      [ 0.000000] reg 2, base: 3GB, range: 1GB, type UC
      [ 0.000000] total RAM covered: 8192M
      [ 0.000000] Found optimal setting for mtrr clean up
      [ 0.000000] gran_size: 64K chunk_size: 64K num_reg: 4 lose cover RAM: 0G
      [ 0.000000] New variable MTRRs
      [ 0.000000] reg 0, base: 0GB, range: 2GB, type WB
      [ 0.000000] reg 1, base: 2GB, range: 1GB, type WB
      [ 0.000000] reg 2, base: 4GB, range: 4GB, type WB
      [ 0.000000] reg 3, base: 8GB, range: 1GB, type WB
      [ 0.000000] e820 update range: 00000000c0000000 - 0000000100000000 (usable) ==> (reserved)
      [ 0.000000] last_pfn = 0xbf780 max_arch_pfn = 0x400000000
      [ 0.000000] found SMP MP-table at [ffff8800000ff780] ff780
      [ 0.000000] initial memory mapped : 0 - 20000000
      [ 0.000000] Base memory trampoline at [ffff880000098000] 98000 size 20480
      [ 0.000000] init_memory_mapping: 0000000000000000-00000000bf780000
      [ 0.000000] 0000000000 - 00bf600000 page 2M
      [ 0.000000] 00bf600000 - 00bf780000 page 4k
      [ 0.000000] kernel direct mapping tables up to bf780000 @ 1fffb000-20000000
      [ 0.000000] init_memory_mapping: 0000000100000000-0000000240000000
      [ 0.000000] 0100000000 - 0240000000 page 2M
      [ 0.000000] kernel direct mapping tables up to 240000000 @ bf776000-bf780000
      [ 0.000000] RAMDISK: 37db5000 - 37ff0000
      [ 0.000000] ACPI: RSDP 00000000000f9cf0 00024 (v02 ACPIAM)
      [ 0.000000] ACPI: XSDT 00000000bf780100 0006C (v01 ACRSYS ACRPRDCT 20100329 MSFT 00000097)
      [ 0.000000] ACPI: FACP 00000000bf780290 000F4 (v04 ACRSYS FACP1137 20100329 MSFT 00000097)
      [ 0.000000] ACPI: DSDT 00000000bf7805e0 07E42 (v02 926A1 926A1P15 00000000 INTL 20051117)
      [ 0.000000] ACPI: FACS 00000000bf78e000 00040
      [ 0.000000] ACPI: APIC 00000000bf780390 0008C (v02 ACRSYS APIC1137 20100329 MSFT 00000097)
      [ 0.000000] ACPI: MCFG 00000000bf780420 0003C (v01 ACRSYS OEMMCFG 20100329 MSFT 00000097)
      [ 0.000000] ACPI: SLIC 00000000bf780460 00176 (v01 ACRSYS ACRPRDCT 20100329 MSFT 00000097)
      [ 0.000000] ACPI: OEMB 00000000bf78e040 00072 (v01 ACRSYS OEMB1137 20100329 MSFT 00000097)
      [ 0.000000] ACPI: HPET 00000000bf78a5e0 00038 (v01 ACRSYS OEMHPET 20100329 MSFT 00000097)
      [ 0.000000] ACPI: GSCI 00000000bf78e0c0 02024 (v01 ACRSYS GMCHSCI 20100329 MSFT 00000097)
      [ 0.000000] ACPI: AWMI 00000000bf7900f0 0004E (v01 ACRSYS OEMB1137 20100329 MSFT 00000097)
      [ 0.000000] ACPI: SSDT 00000000bf792c10 00363 (v01 DpgPmm CpuPm 00000012 INTL 20051117)
      [ 0.000000] ACPI: Local APIC address 0xfee00000
      [ 0.000000] [ffffea0000000000-ffffea0008ffffff] PMD -> [ffff880237600000-ffff88023f5fffff] on node 0
      [ 0.000000] Zone PFN ranges:
      [ 0.000000] DMA 0x00000010 -> 0x00001000
      [ 0.000000] DMA32 0x00001000 -> 0x00100000
      [ 0.000000] Normal 0x00100000 -> 0x00240000
      [ 0.000000] Movable zone start PFN for each node
      [ 0.000000] Early memory PFN ranges
      [ 0.000000] 0: 0x00000010 -> 0x0000009d
      [ 0.000000] 0: 0x00000100 -> 0x000bf780
      [ 0.000000] 0: 0x00100000 -> 0x00240000
      [ 0.000000] On node 0 totalpages: 2094861
      [ 0.000000] DMA zone: 64 pages used for memmap
      [ 0.000000] DMA zone: 5 pages reserved
      [ 0.000000] DMA zone: 3912 pages, LIFO batch:0
      [ 0.000000] DMA32 zone: 16320 pages used for memmap
      [ 0.000000] DMA32 zone: 763840 pages, LIFO batch:31
      [ 0.000000] Normal zone: 20480 pages used for memmap
      [ 0.000000] Normal zone: 1290240 pages, LIFO batch:31
      [ 0.000000] ACPI: PM-Timer IO Port: 0x808
      [ 0.000000] ACPI: Local APIC address 0xfee00000
      [ 0.000000] ACPI: LAPIC (acpi_id[0x01] lapic_id[0x00] enabled)
      [ 0.000000] ACPI: LAPIC (acpi_id[0x02] lapic_id[0x02] enabled)
      [ 0.000000] ACPI: LAPIC (acpi_id[0x03] lapic_id[0x04] enabled)
      [ 0.000000] ACPI: LAPIC (acpi_id[0x04] lapic_id[0x06] enabled)
      [ 0.000000] ACPI: LAPIC (acpi_id[0x05] lapic_id[0x01] enabled)
      [ 0.000000] ACPI: LAPIC (acpi_id[0x06] lapic_id[0x03] enabled)
      [ 0.000000] ACPI: LAPIC (acpi_id[0x07] lapic_id[0x05] enabled)
      [ 0.000000] ACPI: LAPIC (acpi_id[0x08] lapic_id[0x07] enabled)
      [ 0.000000] ACPI: IOAPIC (id[0x08] address[0xfec00000] gsi_base[0])
      [ 0.000000] IOAPIC[0]: apic_id 8, version 32, address 0xfec00000, GSI 0-23
      [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
      [ 0.000000] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
      [ 0.000000] ACPI: IRQ0 used by override.
      [ 0.000000] ACPI: IRQ2 used by override.
      [ 0.000000] ACPI: IRQ9 used by override.
      [ 0.000000] Using ACPI (MADT) for SMP configuration information
      [ 0.000000] ACPI: HPET id: 0xffffffff base: 0xfed00000
      [ 0.000000] SMP: Allowing 8 CPUs, 0 hotplug CPUs
      [ 0.000000] nr_irqs_gsi: 40
      [ 0.000000] PM: Registered nosave memory: 000000000009d000 - 000000000009e000
      [ 0.000000] PM: Registered nosave memory: 000000000009e000 - 00000000000a0000
      [ 0.000000] PM: Registered nosave memory: 00000000000a0000 - 00000000000e0000
      [ 0.000000] PM: Registered nosave memory: 00000000000e0000 - 0000000000100000
      [ 0.000000] PM: Registered nosave memory: 00000000bf780000 - 00000000bf78e000
      [ 0.000000] PM: Registered nosave memory: 00000000bf78e000 - 00000000bf7d0000
      [ 0.000000] PM: Registered nosave memory: 00000000bf7d0000 - 00000000bf7e0000
      [ 0.000000] PM: Registered nosave memory: 00000000bf7e0000 - 00000000bf7ed000
      [ 0.000000] PM: Registered nosave memory: 00000000bf7ed000 - 00000000c0000000
      [ 0.000000] PM: Registered nosave memory: 00000000c0000000 - 00000000fee00000
      [ 0.000000] PM: Registered nosave memory: 00000000fee00000 - 00000000fee01000
      [ 0.000000] PM: Registered nosave memory: 00000000fee01000 - 00000000ffb00000
      [ 0.000000] PM: Registered nosave memory: 00000000ffb00000 - 0000000100000000
      [ 0.000000] Allocating PCI resources starting at c0000000 (gap: c0000000:3ee00000)
      [ 0.000000] setup_percpu: NR_CPUS:16 nr_cpumask_bits:16 nr_cpu_ids:8 nr_node_ids:1
      [ 0.000000] PERCPU: Embedded 25 pages/cpu @ffff88023fc00000 s71424 r8192 d22784 u262144
      [ 0.000000] pcpu-alloc: s71424 r8192 d22784 u262144 alloc=1*2097152
      [ 0.000000] pcpu-alloc: [0] 0 1 2 3 4 5 6 7
      [ 0.000000] Built 1 zonelists in Zone order, mobility grouping on. Total pages: 2057992
      [ 0.000000] Kernel command line: dolvm root=/dev/ram0 init=/linuxrc ramdisk=8192 crypt_root=/dev/sdc6 real_root=/dev/mapper/GENTOO-ROOT noresume noresume2 rootflags=data=journal udev ro elevator=cfq snd-hda-intel.enable_msi=1 fbcon=scrollback:256K pax_softmode=1 raid=noautodetect pcie_aspm=force usbcore.autosuspend=1
      [ 0.000000] PCIe ASPM is forcibly enabled
      [ 0.000000] PID hash table entries: 4096 (order: 3, 32768 bytes)
      [ 0.000000] Dentry cache hash table entries: 1048576 (order: 11, 8388608 bytes)
      [ 0.000000] Inode-cache hash table entries: 524288 (order: 10, 4194304 bytes)
      [ 0.000000] Checking aperture...
      [ 0.000000] No AGP bridge found
      [ 0.000000] Calgary: detecting Calgary via BIOS EBDA area
      [ 0.000000] Calgary: Unable to locate Rio Grande table in EBDA - bailing!
      [ 0.000000] Memory: 8152472k/9437184k available (7282k kernel code, 1057740k absent, 226972k reserved, 5631k data, 560k init)
      [ 0.000000] SLUB: Genslabs=15, HWalign=64, Order=0-3, MinObjects=0, CPUs=8, Nodes=1
      [ 0.000000] Rotary Interactivity Favor Scheduler - RIFS By QQ:3766691.
      [ 0.000000] Preemptible hierarchical RCU implementation.
      [ 0.000000] Dump stacks of tasks blocking RCU-preempt GP.
      [ 0.000000] NR_IRQS:4352 nr_irqs:808 16
      [ 0.000000] Extended CMOS year: 2000
      [ 0.000000] Console: colour VGA+ 80x25
      [ 0.000000] console [tty0] enabled
      [ 0.000000] hpet clockevent registered
      [ 0.000000] Fast TSC calibration using PIT
      [ 0.003333] Detected 2792.871 MHz processor.
      [ 0.000002] Calibrating delay loop (skipped), value calculated using timer frequency.. 5587.48 BogoMIPS (lpj=9309570)
      [ 0.000164] pid_max: default: 32768 minimum: 301
      [ 0.000274] Mount-cache hash table entries: 256
      [ 0.000490] Initializing cgroup subsys freezer
      [ 0.000570] Initializing cgroup subsys bfqio
      [ 0.000675] CPU: Physical Processor ID: 0
      [ 0.000753] CPU: Processor Core ID: 0
      [ 0.000833] mce: CPU supports 9 MCE banks
      [ 0.000921] CPU0: Thermal monitoring enabled (TM1)
      [ 0.001007] using mwait in idle threads.
      [ 0.001133] ACPI: Core revision 20120320
      [ 0.026359] ..TIMER: vector=0x30 apic1=0 pin1=2 apic2=-1 pin2=-1
      [ 0.059387] CPU0: Intel(R) Core(TM) i7 CPU 860 @ 2.80GHz stepping 05
      [ 0.163672] Performance Events: PEBS fmt1+, 16-deep LBR, Nehalem events, Intel PMU driver.
      [ 0.163945] CPU erratum AAJ80 worked around
      [ 0.164024] CPUID marked event: 'bus cycles' unavailable
      [ 0.164106] ... version: 3
      [ 0.164184] ... bit width: 48
      [ 0.164262] ... generic registers: 4
      [ 0.164340] ... value mask: 0000ffffffffffff
      [ 0.164421] ... max period: 000000007fffffff
      [ 0.164501] ... fixed-purpose events: 3
      [ 0.164580] ... event mask: 000000070000000f
      [ 0.164818] NMI watchdog: enabled, takes one hw-pmu counter.
      [ 0.170336] Booting Node 0, Processors #1
      [ 0.183627] NMI watchdog: enabled, takes one hw-pmu counter.
      [ 0.203610] #2
      [ 0.216862] NMI watchdog: enabled, takes one hw-pmu counter.
      [ 0.243523] #3
      [ 0.256768] NMI watchdog: enabled, takes one hw-pmu counter.
      [ 0.270162] #4
      [ 0.283406] NMI watchdog: enabled, takes one hw-pmu counter.
      [ 0.303417] #5
      [ 0.316561] NMI watchdog: enabled, takes one hw-pmu counter.
      [ 0.336542] #6
      [ 0.349789] NMI watchdog: enabled, takes one hw-pmu counter.
      [ 0.369802] #7
      [ 0.383045] NMI watchdog: enabled, takes one hw-pmu counter.
      [ 0.383224] Brought up 8 CPUs
      [ 0.383304] Total of 8 processors activated (44703.85 BogoMIPS).
      <--

      exactly there


      after that normally it would continue with:

      [ 0.388147] devtmpfs: initialized
      [ 0.388453] PM: Registering ACPI NVS region [mem 0xbf78e000-0xbf7cffff] (270336 bytes)
      [ 0.388589] xor: automatically using best checksumming function: generic_sse
      [ 0.403247] generic_sse: 9853.200 MB/sec
      [ 0.403325] xor: using function: generic_sse (9853.200 MB/sec)
      [ 0.403431] NET: Registered protocol family 16
      [ 0.453269] ACPI FADT declares the system doesn't support PCIe ASPM, so disable it
      [ 0.453387] ACPI: bus type pci registered
      [ 0.453531] dca service started, version 1.12.1
      [ 0.453630] PCI: MMCONFIG for domain 0000 [bus 00-ff] at [mem 0xe0000000-0xefffffff] (base 0xe0000000)
      [ 0.454596] PCI: not using MMCONFIG
      [ 0.454672] PCI: Using configuration type 1 for base access
      [ 0.458933] bio: create slab <bio-0> at 0
      [ 0.513238] raid6: int64x1 2569 MB/s
      [ 0.569711] raid6: int64x2 3198 MB/s
      [ 0.626211] raid6: int64x4 2210 MB/s
      [ 0.682780] raid6: int64x8 2116 MB/s
      [ 0.739336] raid6: sse2x1 6970 MB/s
      [ 0.795903] raid6: sse2x2 8109 MB/s
      [ 0.852466] raid6: sse2x4 8980 MB/s
      [ 0.852543] raid6: using algorithm sse2x4 (8980 MB/s)
      [ 0.852651] ACPI: Added _OSI(Module Device)
      [ 0.852731] ACPI: Added _OSI(Processor Device)
      [ 0.852809] ACPI: Added _OSI(3.0 _SCP Extensions)
      [ 0.852888] ACPI: Added _OSI(Processor Aggregator Device)
      [ 0.854053] ACPI: EC: Look up EC in DSDT
      [ 0.855642] ACPI: Executed 1 blocks of module-level executable AML code
      [ 0.869165] ACPI: SSDT 00000000bf790140 0244C (v01 DpgPmm P001Ist 00000011 INTL 20051117)
      [ 0.869696] ACPI: Dynamic OEM Table Load:
      [ 0.869848] ACPI: SSDT (null) 0244C (v01 DpgPmm P001Ist 00000011 INTL 20051117)
      [ 0.870110] ACPI: SSDT 00000000bf792590 00678 (v01 PmRef P001Cst 00003001 INTL 20051117)
      [ 0.870594] ACPI: Dynamic OEM Table Load:
      [ 0.870747] ACPI: SSDT (null) 00678 (v01 PmRef P001Cst 00003001 INTL 20051117)
      [ 0.871504] ACPI: Interpreter enabled
      [ 0.871583] ACPI: (supports S0 S3 S4 S5)
      [ 0.871821] ACPI: Using IOAPIC for interrupt routing
      [ 0.871915] PCI: MMCONFIG for domain 0000 [bus 00-ff] at [mem 0xe0000000-0xefffffff] (base 0xe0000000)
      [ 0.873750] PCI: MMCONFIG at [mem 0xe0000000-0xefffffff] reserved in ACPI motherboard resources
      [ 0.906676] ACPI: No dock devices found.
      [ 0.906756] PCI: Using host bridge windows from ACPI; if necessary, use "pci=nocrs" and report a bug
      [ 0.906999] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-ff])

      not sure how that would help ...



      edit:

      I'll take another look and see if I can spot anything useful


      edit2:

      yeah, nothing more

      on the screen it even only shows 12-20 lines (KMS isn't compiled in since loading the module during bootup locked up the computer for me - so the screen resolution is pretty low during startup until the radeon kernel module is loaded)
      Last edited by kernelOfTruth; 01 July 2012, 11:42 AM.

      Comment


      • #63
        Originally posted by kernelOfTruth View Post
        unfortunately it doesn't say anything useful / nothing at all:

        if I remember correctly it hangs at:

        <--

        exactly there


        after that normally it would continue with:




        not sure how that would help ...



        edit:

        I'll take another look and see if I can spot anything useful


        edit2:

        yeah, nothing more

        on the screen it even only shows 12-20 lines (KMS isn't compiled in since loading the module during bootup locked up the computer for me - so the screen resolution is pretty low during startup until the radeon kernel module is loaded)
        You means that the kernel suddenly hang?

        Comment


        • #64
          Originally posted by 3766691 View Post
          You means that the kernel suddenly hang?
          yes, it just hangs there


          all usb stuff is off (I have an usb keyboard)



          that kernel is an heavy patched kernel (3.4.2-geek) with additional btrfs updates + RIFS

          this could be the reason but I doubt it


          I'll give vanilla 3.4.2 + btrfs changes + RIFS a try and see whether that makes a difference


          edit:

          got the trace:


          cpu_stopper_thread
          __switch_to
          get_parent_ip
          sub_preempt_count
          schedule
          stop_machine_cpu_stop
          kthread
          kernel_thread_helper
          flush_kthread_worker
          gs_change



          don't know if that is identical to the first trace but it seems to loop so that message comes again and again

          hope that helps
          Last edited by kernelOfTruth; 01 July 2012, 12:25 PM.

          Comment


          • #65
            Chen, I had to wait 1-2 minutes until the messages appeared until then it

            stayed there at "Total of 8 processors activated (44703.85 BogoMIPS)."


            and it was the same for both - the fully patched and the slightly patched vanilla 3.4.2 kernel



            I'd like to mostly avoid further tests and boots where it hangs since this stupid computer doesn't have a reset button - putting some stress on the harddrives

            hope you'll figure out what the cause is (could it be changes from 3.4.x to 3.5.x ?)




            looking forward to the DMS variant


            thanks !

            Comment


            • #66
              Originally posted by kernelOfTruth View Post
              Chen, I had to wait 1-2 minutes until the messages appeared until then it

              stayed there at "Total of 8 processors activated (44703.85 BogoMIPS)."


              and it was the same for both - the fully patched and the slightly patched vanilla 3.4.2 kernel



              I'd like to mostly avoid further tests and boots where it hangs since this stupid computer doesn't have a reset button - putting some stress on the harddrives

              hope you'll figure out what the cause is (could it be changes from 3.4.x to 3.5.x ?)




              looking forward to the DMS variant


              thanks !
              Hi

              Try BFS-O1 instead. Normally there wont be any differeny feeling bewteen BFS and BFS-O(1)
              If the hang happened that means there are explicting bugs.
              All the computer in my house can run both of them.
              Chen

              Comment


              • #67
                Originally posted by 3766691 View Post
                Hi

                Try BFS-O1 instead. Normally there wont be any differeny feeling bewteen BFS and BFS-O(1)
                If the hang happened that means there are explicting bugs.
                All the computer in my house can run both of them.
                Chen
                The reason why I want you to try is because DMS and BFS-O1 use the same method to implement the proportional fair sched in O(1) time complexity.

                Comment


                • #68
                  @KernelOfTruth
                  The bug may be fixed with the new updated one. I have just got the same problem when I tune the scheduler.Then I have rewrite a piece of code and I can boot my computer again.

                  Comment


                  • #69
                    Originally posted by 3766691 View Post
                    The reason why I want you to try is because DMS and BFS-O1 use the same method to implement the proportional fair sched in O(1) time complexity.
                    thanks for the clarification and sorry for the delay

                    I'm currently using BFS 424 (the diff of your changes applied cleanly on top of 424) and the system looks pretty snappy

                    probably will do some tests within the next days


                    Originally posted by 3766691 View Post
                    @KernelOfTruth
                    The bug may be fixed with the new updated one. I have just got the same problem when I tune the scheduler.Then I have rewrite a piece of code and I can boot my computer again.
                    unfortunately there's still the same issue / error message with latest rifs changes


                    edit:

                    for those that want to try it on top of 424 or 423:

                    Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.
                    Last edited by kernelOfTruth; 02 July 2012, 04:35 PM.

                    Comment


                    • #70
                      lol - it seems that BFS + your tweaks challenge btrfs and the kernel quite a lot

                      Jul 2 22:34:06 lupus kernel: [ 599.734807] INFO: task btrfs-transacti:10787 blocked for more than 120 seconds.
                      Jul 2 22:34:06 lupus kernel: [ 599.734812] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
                      Jul 2 22:34:06 lupus kernel: [ 599.734816] btrfs-transacti D ffff88023fc113c0 0 10787 2 0x00000000
                      Jul 2 22:34:06 lupus kernel: [ 599.734823] ffff880236b10c00 0000000000000046 ffff8801eae324b8 ffff8801bd059e00
                      Jul 2 22:34:06 lupus kernel: [ 599.734832] ffff8801a0a4bfd8 ffff8801a0a4bfd8 ffff8801a0a4bfd8 ffff8801bd059e00
                      Jul 2 22:34:06 lupus kernel: [ 599.734838] 0000000000000001 ffff8801817784b0 ffff8801c9a1fc00 0000000000000000
                      Jul 2 22:34:06 lupus kernel: [ 599.734845] Call Trace:
                      Jul 2 22:34:06 lupus kernel: [ 599.734856] [<ffffffff81716aa2>] ? schedule_timeout+0x192/0x210
                      Jul 2 22:34:06 lupus kernel: [ 599.734864] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:34:06 lupus kernel: [ 599.734869] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:34:06 lupus kernel: [ 599.734874] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:34:06 lupus kernel: [ 599.734879] [<ffffffff8107e3f7>] ? sub_preempt_count+0x57/0xb0
                      Jul 2 22:34:06 lupus kernel: [ 599.734886] [<ffffffff81719573>] ? _raw_spin_unlock_irqrestore+0x13/0x40
                      Jul 2 22:34:06 lupus kernel: [ 599.734894] [<ffffffff81309011>] ? btrfs_commit_transaction+0x341/0xa00
                      Jul 2 22:34:06 lupus kernel: [ 599.734901] [<ffffffff81073870>] ? finish_wait+0x90/0x90
                      Jul 2 22:34:06 lupus kernel: [ 599.734906] [<ffffffff81309b35>] ? start_transaction+0x95/0x2c0
                      Jul 2 22:34:06 lupus kernel: [ 599.734911] [<ffffffff81302b05>] ? transaction_kthread+0x265/0x2f0
                      Jul 2 22:34:06 lupus kernel: [ 599.734917] [<ffffffff813028a0>] ? btree_lock_page_hook+0x1b0/0x1b0
                      Jul 2 22:34:06 lupus kernel: [ 599.734922] [<ffffffff81073575>] ? kthread+0x85/0x90
                      Jul 2 22:34:06 lupus kernel: [ 599.734929] [<ffffffff8171b594>] ? kernel_thread_helper+0x4/0x10
                      Jul 2 22:34:06 lupus kernel: [ 599.734934] [<ffffffff810734f0>] ? flush_kthread_worker+0x80/0x80
                      Jul 2 22:34:06 lupus kernel: [ 599.734940] [<ffffffff8171b590>] ? gs_change+0xb/0xb
                      Jul 2 22:34:06 lupus kernel: [ 599.734947] INFO: task rsync:10834 blocked for more than 120 seconds.
                      Jul 2 22:34:06 lupus kernel: [ 599.734950] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
                      Jul 2 22:34:06 lupus kernel: [ 599.734953] rsync D ffff88023fc513c0 0 10834 10833 0x00000000
                      Jul 2 22:34:06 lupus kernel: [ 599.734960] ffff8802368b5a00 0000000000000082 0000000000000001 ffff88023536e600
                      Jul 2 22:34:06 lupus kernel: [ 599.734967] ffff880183e63fd8 ffff880183e63fd8 ffff880183e63fd8 ffff88023536e600
                      Jul 2 22:34:06 lupus kernel: [ 599.734973] ffff8801c9a1b620 ffff8801c9a1b400 ffff8801c9a1b620 ffff880183e63ed8
                      Jul 2 22:34:06 lupus kernel: [ 599.734981] Call Trace:
                      Jul 2 22:34:06 lupus kernel: [ 599.734986] [<ffffffff813009d3>] ? btrfs_find_free_ino+0xb3/0x140
                      Jul 2 22:34:06 lupus kernel: [ 599.734992] [<ffffffff81073870>] ? finish_wait+0x90/0x90
                      Jul 2 22:34:06 lupus kernel: [ 599.734999] [<ffffffff8131600d>] ? btrfs_mkdir+0x6d/0x1f0
                      Jul 2 22:34:06 lupus kernel: [ 599.735006] [<ffffffff81134583>] ? vfs_mkdir+0x93/0x140
                      Jul 2 22:34:06 lupus kernel: [ 599.735011] [<ffffffff811387be>] ? sys_mkdirat+0xbe/0xd0
                      Jul 2 22:34:06 lupus kernel: [ 599.735017] [<ffffffff8171a266>] ? system_call_fastpath+0x1a/0x1f
                      Jul 2 22:34:06 lupus kernel: [ 599.735021] INFO: task rsync:10835 blocked for more than 120 seconds.
                      Jul 2 22:34:06 lupus kernel: [ 599.735025] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
                      Jul 2 22:34:06 lupus kernel: [ 599.735028] rsync D ffff88023fc913c0 0 10835 10834 0x00000000
                      Jul 2 22:34:06 lupus kernel: [ 599.735035] ffff880236998000 0000000000000086 0000000000000001 ffff8801bd058000
                      Jul 2 22:34:06 lupus kernel: [ 599.735042] ffff880183ec3fd8 ffff880183ec3fd8 ffff880183ec3fd8 ffff8801bd058000
                      Jul 2 22:34:06 lupus kernel: [ 599.735048] ffff8801f7afc230 ffff88021e9c4128 ffff8801c9a1b520 0000000000000000
                      Jul 2 22:34:06 lupus kernel: [ 599.735055] Call Trace:
                      Jul 2 22:34:06 lupus kernel: [ 599.735060] [<ffffffff81308695>] ? wait_current_trans.isra.30+0xa5/0xf0
                      Jul 2 22:34:06 lupus kernel: [ 599.735065] [<ffffffff81073870>] ? finish_wait+0x90/0x90
                      Jul 2 22:34:06 lupus kernel: [ 599.735070] [<ffffffff81309be5>] ? start_transaction+0x145/0x2c0
                      Jul 2 22:34:06 lupus kernel: [ 599.735076] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:34:06 lupus kernel: [ 599.735082] [<ffffffff81315597>] ? btrfs_rename+0x137/0x6e0
                      Jul 2 22:34:06 lupus kernel: [ 599.735087] [<ffffffff81136735>] ? path_lookupat+0x65/0x6d0
                      Jul 2 22:34:06 lupus kernel: [ 599.735094] [<ffffffff81061445>] ? ns_capable+0x25/0x50
                      Jul 2 22:34:06 lupus kernel: [ 599.735100] [<ffffffff811352d6>] ? vfs_rename+0x126/0x4f0
                      Jul 2 22:34:06 lupus kernel: [ 599.735105] [<ffffffff811401ce>] ? d_lookup+0x2e/0x60
                      Jul 2 22:34:06 lupus kernel: [ 599.735110] [<ffffffff8113313c>] ? __lookup_hash+0x2c/0x120
                      Jul 2 22:34:06 lupus kernel: [ 599.735115] [<ffffffff81138cba>] ? sys_renameat+0x21a/0x240
                      Jul 2 22:34:06 lupus kernel: [ 599.735121] [<ffffffff81143745>] ? notify_change+0x2c5/0x350
                      Jul 2 22:34:06 lupus kernel: [ 599.735127] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:34:06 lupus kernel: [ 599.735132] [<ffffffff8107e3f7>] ? sub_preempt_count+0x57/0xb0
                      Jul 2 22:34:06 lupus kernel: [ 599.735138] [<ffffffff8107e4cf>] ? add_preempt_count+0x7f/0xb0
                      Jul 2 22:34:06 lupus kernel: [ 599.735143] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:34:06 lupus kernel: [ 599.735148] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:34:06 lupus kernel: [ 599.735153] [<ffffffff8107e3f7>] ? sub_preempt_count+0x57/0xb0
                      Jul 2 22:34:06 lupus kernel: [ 599.735158] [<ffffffff81144d34>] ? vfsmount_lock_local_unlock+0x24/0x40
                      Jul 2 22:34:06 lupus kernel: [ 599.735164] [<ffffffff8171a266>] ? system_call_fastpath+0x1a/0x1f
                      Jul 2 22:36:06 lupus kernel: [ 719.528311] INFO: task btrfs-transacti:10787 blocked for more than 120 seconds.
                      Jul 2 22:36:06 lupus kernel: [ 719.528315] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
                      Jul 2 22:36:06 lupus kernel: [ 719.528320] btrfs-transacti D ffff88023fc113c0 0 10787 2 0x00000000
                      Jul 2 22:36:06 lupus kernel: [ 719.528327] ffff880236b10c00 0000000000000046 ffff8801eae324b8 ffff8801bd059e00
                      Jul 2 22:36:06 lupus kernel: [ 719.528334] ffff8801a0a4bfd8 ffff8801a0a4bfd8 ffff8801a0a4bfd8 ffff8801bd059e00
                      Jul 2 22:36:06 lupus kernel: [ 719.528341] 0000000000000001 ffff8801817784b0 ffff8801c9a1fc00 0000000000000000
                      Jul 2 22:36:06 lupus kernel: [ 719.528348] Call Trace:
                      Jul 2 22:36:06 lupus kernel: [ 719.528358] [<ffffffff81716aa2>] ? schedule_timeout+0x192/0x210
                      Jul 2 22:36:06 lupus kernel: [ 719.528366] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:36:06 lupus kernel: [ 719.528371] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:36:06 lupus kernel: [ 719.528376] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:36:06 lupus kernel: [ 719.528382] [<ffffffff8107e3f7>] ? sub_preempt_count+0x57/0xb0
                      Jul 2 22:36:06 lupus kernel: [ 719.528388] [<ffffffff81719573>] ? _raw_spin_unlock_irqrestore+0x13/0x40
                      Jul 2 22:36:06 lupus kernel: [ 719.528396] [<ffffffff81309011>] ? btrfs_commit_transaction+0x341/0xa00
                      Jul 2 22:36:06 lupus kernel: [ 719.528402] [<ffffffff81073870>] ? finish_wait+0x90/0x90
                      Jul 2 22:36:06 lupus kernel: [ 719.528407] [<ffffffff81309b35>] ? start_transaction+0x95/0x2c0
                      Jul 2 22:36:06 lupus kernel: [ 719.528413] [<ffffffff81302b05>] ? transaction_kthread+0x265/0x2f0
                      Jul 2 22:36:06 lupus kernel: [ 719.528418] [<ffffffff813028a0>] ? btree_lock_page_hook+0x1b0/0x1b0
                      Jul 2 22:36:06 lupus kernel: [ 719.528423] [<ffffffff81073575>] ? kthread+0x85/0x90
                      Jul 2 22:36:06 lupus kernel: [ 719.528430] [<ffffffff8171b594>] ? kernel_thread_helper+0x4/0x10
                      Jul 2 22:36:06 lupus kernel: [ 719.528436] [<ffffffff810734f0>] ? flush_kthread_worker+0x80/0x80
                      Jul 2 22:36:06 lupus kernel: [ 719.528441] [<ffffffff8171b590>] ? gs_change+0xb/0xb
                      Jul 2 22:36:06 lupus kernel: [ 719.528446] INFO: task rsync:10834 blocked for more than 120 seconds.
                      Jul 2 22:36:06 lupus kernel: [ 719.528449] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
                      Jul 2 22:36:06 lupus kernel: [ 719.528452] rsync D ffff88023fc513c0 0 10834 10833 0x00000000
                      Jul 2 22:36:06 lupus kernel: [ 719.528459] ffff8802368b5a00 0000000000000082 0000000000000001 ffff88023536e600
                      Jul 2 22:36:06 lupus kernel: [ 719.528465] ffff880183e63fd8 ffff880183e63fd8 ffff880183e63fd8 ffff88023536e600
                      Jul 2 22:36:06 lupus kernel: [ 719.528472] ffff8801c9a1b620 ffff8801c9a1b400 ffff8801c9a1b620 ffff880183e63ed8
                      Jul 2 22:36:06 lupus kernel: [ 719.528479] Call Trace:
                      Jul 2 22:36:06 lupus kernel: [ 719.528484] [<ffffffff813009d3>] ? btrfs_find_free_ino+0xb3/0x140
                      Jul 2 22:36:06 lupus kernel: [ 719.528489] [<ffffffff81073870>] ? finish_wait+0x90/0x90
                      Jul 2 22:36:06 lupus kernel: [ 719.528495] [<ffffffff8131600d>] ? btrfs_mkdir+0x6d/0x1f0
                      Jul 2 22:36:06 lupus kernel: [ 719.528503] [<ffffffff81134583>] ? vfs_mkdir+0x93/0x140
                      Jul 2 22:36:06 lupus kernel: [ 719.528508] [<ffffffff811387be>] ? sys_mkdirat+0xbe/0xd0
                      Jul 2 22:36:06 lupus kernel: [ 719.528513] [<ffffffff8171a266>] ? system_call_fastpath+0x1a/0x1f
                      Jul 2 22:36:06 lupus kernel: [ 719.528518] INFO: task rsync:10835 blocked for more than 120 seconds.
                      Jul 2 22:36:06 lupus kernel: [ 719.528521] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
                      Jul 2 22:36:06 lupus kernel: [ 719.528524] rsync D ffff88023fc913c0 0 10835 10834 0x00000000
                      Jul 2 22:36:06 lupus kernel: [ 719.528530] ffff880236998000 0000000000000086 0000000000000001 ffff8801bd058000
                      Jul 2 22:36:06 lupus kernel: [ 719.528536] ffff880183ec3fd8 ffff880183ec3fd8 ffff880183ec3fd8 ffff8801bd058000
                      Jul 2 22:36:06 lupus kernel: [ 719.528543] ffff8801f7afc230 ffff88021e9c4128 ffff8801c9a1b520 0000000000000000
                      Jul 2 22:36:06 lupus kernel: [ 719.528549] Call Trace:
                      Jul 2 22:36:06 lupus kernel: [ 719.528555] [<ffffffff81308695>] ? wait_current_trans.isra.30+0xa5/0xf0
                      Jul 2 22:36:06 lupus kernel: [ 719.528560] [<ffffffff81073870>] ? finish_wait+0x90/0x90
                      Jul 2 22:36:06 lupus kernel: [ 719.528565] [<ffffffff81309be5>] ? start_transaction+0x145/0x2c0
                      Jul 2 22:36:06 lupus kernel: [ 719.528570] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:36:06 lupus kernel: [ 719.528576] [<ffffffff81315597>] ? btrfs_rename+0x137/0x6e0
                      Jul 2 22:36:06 lupus kernel: [ 719.528581] [<ffffffff81136735>] ? path_lookupat+0x65/0x6d0
                      Jul 2 22:36:06 lupus kernel: [ 719.528588] [<ffffffff81061445>] ? ns_capable+0x25/0x50
                      Jul 2 22:36:06 lupus kernel: [ 719.528594] [<ffffffff811352d6>] ? vfs_rename+0x126/0x4f0
                      Jul 2 22:36:06 lupus kernel: [ 719.528599] [<ffffffff811401ce>] ? d_lookup+0x2e/0x60
                      Jul 2 22:36:06 lupus kernel: [ 719.528605] [<ffffffff8113313c>] ? __lookup_hash+0x2c/0x120
                      Jul 2 22:36:06 lupus kernel: [ 719.528609] [<ffffffff81138cba>] ? sys_renameat+0x21a/0x240
                      Jul 2 22:36:06 lupus kernel: [ 719.528616] [<ffffffff81143745>] ? notify_change+0x2c5/0x350
                      Jul 2 22:36:06 lupus kernel: [ 719.528621] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:36:06 lupus kernel: [ 719.528626] [<ffffffff8107e3f7>] ? sub_preempt_count+0x57/0xb0
                      Jul 2 22:36:06 lupus kernel: [ 719.528632] [<ffffffff8107e4cf>] ? add_preempt_count+0x7f/0xb0
                      Jul 2 22:36:06 lupus kernel: [ 719.528637] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:36:06 lupus kernel: [ 719.528642] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:36:06 lupus kernel: [ 719.528647] [<ffffffff8107e3f7>] ? sub_preempt_count+0x57/0xb0
                      Jul 2 22:36:06 lupus kernel: [ 719.528653] [<ffffffff81144d34>] ? vfsmount_lock_local_unlock+0x24/0x40
                      Jul 2 22:36:06 lupus kernel: [ 719.528659] [<ffffffff8171a266>] ? system_call_fastpath+0x1a/0x1f
                      Jul 2 22:38:06 lupus kernel: [ 839.321734] INFO: task btrfs-transacti:10787 blocked for more than 120 seconds.
                      Jul 2 22:38:06 lupus kernel: [ 839.321739] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
                      Jul 2 22:38:06 lupus kernel: [ 839.321743] btrfs-transacti D ffff88023fc113c0 0 10787 2 0x00000000
                      Jul 2 22:38:06 lupus kernel: [ 839.321750] ffff880236b10c00 0000000000000046 ffff8801eae324b8 ffff8801bd059e00
                      Jul 2 22:38:06 lupus kernel: [ 839.321758] ffff8801a0a4bfd8 ffff8801a0a4bfd8 ffff8801a0a4bfd8 ffff8801bd059e00
                      Jul 2 22:38:06 lupus kernel: [ 839.321764] 0000000000000001 ffff8801817784b0 ffff8801c9a1fc00 0000000000000000
                      Jul 2 22:38:06 lupus kernel: [ 839.321772] Call Trace:
                      Jul 2 22:38:06 lupus kernel: [ 839.321781] [<ffffffff81716aa2>] ? schedule_timeout+0x192/0x210
                      Jul 2 22:38:06 lupus kernel: [ 839.321789] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:38:06 lupus kernel: [ 839.321795] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:38:06 lupus kernel: [ 839.321800] [<ffffffff8107e389>] ? get_parent_ip+0x9/0x20
                      Jul 2 22:38:06 lupus kernel: [ 839.321805] [<ffffffff8107e3f7>] ? sub_preempt_count+0x57/0xb0
                      Jul 2 22:38:06 lupus kernel: [ 839.321811] [<ffffffff81719573>] ? _raw_spin_unlock_irqrestore+0x13/0x40
                      Jul 2 22:38:06 lupus kernel: [ 839.321819] [<ffffffff81309011>] ? btrfs_commit_transaction+0x341/0xa00
                      Jul 2 22:38:06 lupus kernel: [ 839.321825] [<ffffffff81073870>] ? finish_wait+0x90/0x90
                      Jul 2 22:38:06 lupus kernel: [ 839.321831] [<ffffffff81309b35>] ? start_transaction+0x95/0x2c0
                      Jul 2 22:38:06 lupus kernel: [ 839.321836] [<ffffffff81302b05>] ? transaction_kthread+0x265/0x2f0
                      Jul 2 22:38:06 lupus kernel: [ 839.321841] [<ffffffff813028a0>] ? btree_lock_page_hook+0x1b0/0x1b0
                      Jul 2 22:38:06 lupus kernel: [ 839.321846] [<ffffffff81073575>] ? kthread+0x85/0x90
                      Jul 2 22:38:06 lupus kernel: [ 839.321853] [<ffffffff8171b594>] ? kernel_thread_helper+0x4/0x10
                      Jul 2 22:38:06 lupus kernel: [ 839.321858] [<ffffffff810734f0>] ? flush_kthread_worker+0x80/0x80
                      Jul 2 22:38:06 lupus kernel: [ 839.321864] [<ffffffff8171b590>] ? gs_change+0xb/0xb
                      now I can't use it anymore

                      good thing to know that


                      hopefully ZFS works well enough

                      Comment

                      Working...
                      X