[OpenAFS] Re: advice on troubleshooting blocked cache manager on
Wed, 27 Jan 2010 16:42:15 -0500
On Wed, Jan 27, 2010 at 4:30 PM, Adam Megacz <firstname.lastname@example.org> wrote:
> Derrick Brashear <email@example.com> writes:
>>> =A0Lock afs_xvcache status: (none_waiting, write_locked(pid:11013
> Ah, so I am to interpret the thing after the comma as the name of a
> function somewhere within the openafs source code. =A0Knowing that helps =
no, the number is a lock number. find it.
>> assuming you're not running disconnected and actively trying to
>> So then the question is why FlushVCBs is blocking you. well, you said
>> you had multihomed fileservers.
> To be completely precise, one of my fileservers is a machine with two IP
> addresses, with a one-line NetInfo file. =A0By "multihomed" did you mean
> "on a machine with two public IPs" or "the AFS server somehow knows
> about both IPs"?
>> RXAFS_GiveUpCallBacks is called here. you didn't perchance grab
>> rxdebug output for the client at this point?
> Sorry, no; I will do that next time.
>> could we address this? yes! how? well, i suppose we could on network
>> events (macos has support for this) and when a new server is
>> discovered, probe all addresses, so any unreachable addresses are
>> marked down in advance.
> How do I ask the cache manager to tell me what IPs it thinks a
> particular server has?
You don't. You can ask the vlserver, which is how the CM found out anyhow:
vos listaddrs -printuuid -noresolve