[Bro-Dev] [JIRA] (BIT-1306) bro process would get stuck/freeze with myricom drivers

Aashish Sharma (JIRA) jira at bro-tracker.atlassian.net
Thu Jan 22 14:57:00 PST 2015


Aashish Sharma created BIT-1306:
-----------------------------------

             Summary: bro process would get stuck/freeze with myricom drivers
                 Key: BIT-1306
                 URL: https://bro-tracker.atlassian.net/browse/BIT-1306
             Project: Bro Issue Tracker
          Issue Type: Problem
          Components: Bro
    Affects Versions: git/master
         Environment:  OS: FreeBSD 9.3-RELEASE-p5 OS

bro version 2.3-328

git log -1 --format="%H"
379593c7fded0f9791ae71a52dd78a4c9d5a2c1f

            Reporter: Aashish Sharma


When I stop bro (in cluster mode), one of the bro worker process (random) would get stuck and wouldn't shutdown, stop or even be killed using kill -s 9. 

System has to be ultimately rebooted to remove stuck bro process. 
On running  myri_start_stop I see:

# /usr/local/opt/snf/sbin/myri_start_stop stop
Removing myri_snf.ko
kldunload: can't unload file: Device busy

It appears that the myri_snf.ko driver cannot be unloaded because of the stuck bro process.  That process still has an open descriptor on the Sniffer device/driver and bro process freezes 

More details:

The bro process is stuck in RNE state

R       Marks a runnable process.
N       The process has reduced CPU scheduling priority (see setpriority(2)).
E       The process is trying to exit.

Here is an example:

### stuck process:

[bro at 01 ~]$ ps auxwww | fgrep 1616
bro    1616  100.0  0.0 758040 60480 ??  RNE   2:57PM   53:50.04 /usr/local/bro-git/bin/bro -i myri0 -U .status -p broctl -p broctl-live -p local -p worker-1-1 mgr.bro broctl base/frameworks/cluster local-worker.bro broctl/auto

####when checking for process in proc:

[bro at c ~]$ ls -l /proc/1616
ls: /proc/1616: No such file or directory




--
This message was sent by Atlassian JIRA
(v6.4-OD-13-026#64011)


More information about the bro-dev mailing list