[OpenAFS] Terminal server/Citrix & AFS Client

Jeffrey Altman jaltman@secure-endpoints.com
Wed, 03 Sep 2008 20:42:35 -0400


This is a cryptographically signed message in MIME format.

--------------ms040305030605060801040901
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit

Robin Manke-Cassidy wrote:
> We are running a Citrix farm using AFS as the main storage for the
> users.  We have been experiencing some stability issues with the
> client.  I am looking for anyone that is using terminal server or Citrix
> in a high volume situation.  Here are the symptoms that we are experiencing: 
> 
> 1.  No one can get to the S drive, even after a log out and relaunch of
> an app.  In this case, we've found that the Service is running, but is
> completely non-responsive.  AFS space is at this point, completely
> unusable, and a service restart works only 10-20% of the time.

That indicates either a deadlock or the smb client has dropped the
connection to the "AFS" file service.

If it is a deadlock it is bug that needs to be fixed.  You can obtain a
minidump of the process with "fs minidump" and have it examined offline
to determine if it is in fact a deadlock.

If it is not a deadlock, it could be that too many previous SMB requests
took longer than the SMB client's 45 second timeout period.  In which
case it backs off to reduce load on the SMB server.

What errors or warnings are you seeing in the Windows Application Event
Log?

I have in the last couple of weeks implemented deadlock detection code
within afsd_service.exe.  I sent a link to Jack Hsu earlier today a link
to a private build that implements many fixes based upon the potential
deadlocks that the lock order validation code identified.

> 2.  The client seems to stop responding while folks are on the server. 
> Those that attached before the failure seem to be ok.  

Normally I would say this sounds odd but ...

> Anyone new coming on a server is without an S: drive.  

Are attempts to communicate with the "AFS" file service failing with
an authentication failure?  Perhaps "wrong password" or something else?

I ask because I fixed a bug last week that was leaking memory if the
SMB client was attempting to authenticate and failed.  The leaked memory
was allocated by the LSA so it could have resulted in the LSA running
out of memory.  If so, there would be errors logged to the afsd log
files if "fs trace" was actived.  This fix is also in the private
build I pointed Jack Hsu at.

> 75-90% of the time, a service restart fixes the issue.

But not the rest of the time.  What are the error conditions?

What does the afsd_init.log file report?

Is the "AFS" netbios name being registered in the failure case?

> 3.  One or two people on a server are unable to get to their S: drive. 
> All existing with an S: drive and most to all new sessions get the S:
> drive.  This appears to be caused by the client not initializing
> correctly within the session startup, and is resolved nearly 100% of the
> time with a logout and relaunch of the app.

What is "client" in this context?

> The S: drive is the AFS mounted volume.

When you execute "NET USE", what is S: mapped to?

Is it the freelance root.afs volume?

The cell's root.afs volume?

A per-user home directory?

What does "fs examine \\afs\<cell>#<volume>\" report as the status
for the volume in question?

---

More general questions:

What version of OpenAFS are you using?

How have you tuned the client?  The default values are not appropriate
for a multi-user system?

Is this 32-bit or 64-bit Citrix?  64-bit is strongly recommended as the
maximum cache size on 32-bit systems is ~1GB.

Are there communication problems between the Citrix machines and the
AFS file servers?   We have been seeing problems recently with Rx jumbo
grams and networks that are less than friendly to fragmented UDP packets.

Jeffrey Altman
Secure Endpoints Inc.



--------------ms040305030605060801040901
Content-Type: application/x-pkcs7-signature; name="smime.p7s"
Content-Transfer-Encoding: base64
Content-Disposition: attachment; filename="smime.p7s"
Content-Description: S/MIME Cryptographic Signature

MIAGCSqGSIb3DQEHAqCAMIACAQExCzAJBgUrDgMCGgUAMIAGCSqGSIb3DQEHAQAAoIIJeTCC
AxcwggKAoAMCAQICEDsE+kRcmomW1hYG6BoqhGEwDQYJKoZIhvcNAQEFBQAwYjELMAkGA1UE
BhMCWkExJTAjBgNVBAoTHFRoYXd0ZSBDb25zdWx0aW5nIChQdHkpIEx0ZC4xLDAqBgNVBAMT
I1RoYXd0ZSBQZXJzb25hbCBGcmVlbWFpbCBJc3N1aW5nIENBMB4XDTA4MDUzMDE5MTUyOVoX
DTA5MDUzMDE5MTUyOVowczEPMA0GA1UEBBMGQWx0bWFuMRUwEwYDVQQqEwxKZWZmcmV5IEVy
aWMxHDAaBgNVBAMTE0plZmZyZXkgRXJpYyBBbHRtYW4xKzApBgkqhkiG9w0BCQEWHGphbHRt
YW5Ac2VjdXJlLWVuZHBvaW50cy5jb20wggEiMA0GCSqGSIb3DQEBAQUAA4IBDwAwggEKAoIB
AQCtf5bVJdYFtHIrV2XALpA5oaMu7FPYU7RP7vJhd8Cu9Kd9ud2crX2pHK4avuPaYb4Vg9qI
zPrPadePhJ3OWwNt1ZlUlpc5URnOfpg/I9iymZBUSnCFVLuIvoncacqyUlzqdYEF8XGEoEL6
6bj8uoCSX0D7ZjZiAS8993NvgiPYpf10acMyWQ4max+P7Wg9T03Nw2F6EsmP6gWxBRsekTXe
N6QjJdvaK0846lDqeBFoCEzIUMQXj2kiXVPCPEdxPc/L1sDMYf0GLaDIg8qyThpGd0X6DwfK
3RWcMy8DV7Q5Z+jSEdPn5X0l4anOTrjr3IwE57MC3bVs0EEpUODTzftnAgMBAAGjOTA3MCcG
A1UdEQQgMB6BHGphbHRtYW5Ac2VjdXJlLWVuZHBvaW50cy5jb20wDAYDVR0TAQH/BAIwADAN
BgkqhkiG9w0BAQUFAAOBgQA9kndmeLrdQOUbhNGGms/FnfDyraH4OjA4PIIMOCbGWK0YXczs
/Fqn4XkT70SG4s8v4Zg6TaAcJrZBVcZQXyzrhlF2Zev/g69zZMHQe+2r4i/3FBVKAtFCoea1
vgwJ5TfZYlKvt4D0Z4zexu9Y0VwCIR4plWjVD76zC2CGB/2fhjCCAxcwggKAoAMCAQICEDsE
+kRcmomW1hYG6BoqhGEwDQYJKoZIhvcNAQEFBQAwYjELMAkGA1UEBhMCWkExJTAjBgNVBAoT
HFRoYXd0ZSBDb25zdWx0aW5nIChQdHkpIEx0ZC4xLDAqBgNVBAMTI1RoYXd0ZSBQZXJzb25h
bCBGcmVlbWFpbCBJc3N1aW5nIENBMB4XDTA4MDUzMDE5MTUyOVoXDTA5MDUzMDE5MTUyOVow
czEPMA0GA1UEBBMGQWx0bWFuMRUwEwYDVQQqEwxKZWZmcmV5IEVyaWMxHDAaBgNVBAMTE0pl
ZmZyZXkgRXJpYyBBbHRtYW4xKzApBgkqhkiG9w0BCQEWHGphbHRtYW5Ac2VjdXJlLWVuZHBv
aW50cy5jb20wggEiMA0GCSqGSIb3DQEBAQUAA4IBDwAwggEKAoIBAQCtf5bVJdYFtHIrV2XA
LpA5oaMu7FPYU7RP7vJhd8Cu9Kd9ud2crX2pHK4avuPaYb4Vg9qIzPrPadePhJ3OWwNt1ZlU
lpc5URnOfpg/I9iymZBUSnCFVLuIvoncacqyUlzqdYEF8XGEoEL66bj8uoCSX0D7ZjZiAS89
93NvgiPYpf10acMyWQ4max+P7Wg9T03Nw2F6EsmP6gWxBRsekTXeN6QjJdvaK0846lDqeBFo
CEzIUMQXj2kiXVPCPEdxPc/L1sDMYf0GLaDIg8qyThpGd0X6DwfK3RWcMy8DV7Q5Z+jSEdPn
5X0l4anOTrjr3IwE57MC3bVs0EEpUODTzftnAgMBAAGjOTA3MCcGA1UdEQQgMB6BHGphbHRt
YW5Ac2VjdXJlLWVuZHBvaW50cy5jb20wDAYDVR0TAQH/BAIwADANBgkqhkiG9w0BAQUFAAOB
gQA9kndmeLrdQOUbhNGGms/FnfDyraH4OjA4PIIMOCbGWK0YXczs/Fqn4XkT70SG4s8v4Zg6
TaAcJrZBVcZQXyzrhlF2Zev/g69zZMHQe+2r4i/3FBVKAtFCoea1vgwJ5TfZYlKvt4D0Z4ze
xu9Y0VwCIR4plWjVD76zC2CGB/2fhjCCAz8wggKooAMCAQICAQ0wDQYJKoZIhvcNAQEFBQAw
gdExCzAJBgNVBAYTAlpBMRUwEwYDVQQIEwxXZXN0ZXJuIENhcGUxEjAQBgNVBAcTCUNhcGUg
VG93bjEaMBgGA1UEChMRVGhhd3RlIENvbnN1bHRpbmcxKDAmBgNVBAsTH0NlcnRpZmljYXRp
b24gU2VydmljZXMgRGl2aXNpb24xJDAiBgNVBAMTG1RoYXd0ZSBQZXJzb25hbCBGcmVlbWFp
bCBDQTErMCkGCSqGSIb3DQEJARYccGVyc29uYWwtZnJlZW1haWxAdGhhd3RlLmNvbTAeFw0w
MzA3MTcwMDAwMDBaFw0xMzA3MTYyMzU5NTlaMGIxCzAJBgNVBAYTAlpBMSUwIwYDVQQKExxU
aGF3dGUgQ29uc3VsdGluZyAoUHR5KSBMdGQuMSwwKgYDVQQDEyNUaGF3dGUgUGVyc29uYWwg
RnJlZW1haWwgSXNzdWluZyBDQTCBnzANBgkqhkiG9w0BAQEFAAOBjQAwgYkCgYEAxKY8VXNV
+065yplaHmjAdQRwnd/p/6Me7L3N9VvyGna9fww6YfK/Uc4B1OVQCjDXAmNaLIkVcI7dyfAr
hVqqP3FWy688Cwfn8R+RNiQqE88r1fOCdz0Dviv+uxg+B79AgAJk16emu59l0cUqVIUPSAR/
p7bRPGEEQB5kGXJgt/sCAwEAAaOBlDCBkTASBgNVHRMBAf8ECDAGAQH/AgEAMEMGA1UdHwQ8
MDowOKA2oDSGMmh0dHA6Ly9jcmwudGhhd3RlLmNvbS9UaGF3dGVQZXJzb25hbEZyZWVtYWls
Q0EuY3JsMAsGA1UdDwQEAwIBBjApBgNVHREEIjAgpB4wHDEaMBgGA1UEAxMRUHJpdmF0ZUxh
YmVsMi0xMzgwDQYJKoZIhvcNAQEFBQADgYEASIzRUIPqCy7MDaNmrGcPf6+svsIXoUOWlJ1/
TCG4+DYfqi2fNi/A9BxQIJNwPP2t4WFiw9k6GX6EsZkbAMUaC4J0niVQlGLH2ydxVyWN3amc
OY6MIE9lX5Xa9/eH1sYITq726jTlEBpbNU1341YheILcIRk13iSx0x1G/11fZU8xggNkMIID
YAIBATB2MGIxCzAJBgNVBAYTAlpBMSUwIwYDVQQKExxUaGF3dGUgQ29uc3VsdGluZyAoUHR5
KSBMdGQuMSwwKgYDVQQDEyNUaGF3dGUgUGVyc29uYWwgRnJlZW1haWwgSXNzdWluZyBDQQIQ
OwT6RFyaiZbWFgboGiqEYTAJBgUrDgMCGgUAoIIBwzAYBgkqhkiG9w0BCQMxCwYJKoZIhvcN
AQcBMBwGCSqGSIb3DQEJBTEPFw0wODA5MDQwMDQyMzVaMCMGCSqGSIb3DQEJBDEWBBRYY6MS
dzY8jaqiK28QkrIJHA9zzjBSBgkqhkiG9w0BCQ8xRTBDMAoGCCqGSIb3DQMHMA4GCCqGSIb3
DQMCAgIAgDANBggqhkiG9w0DAgIBQDAHBgUrDgMCBzANBggqhkiG9w0DAgIBKDCBhQYJKwYB
BAGCNxAEMXgwdjBiMQswCQYDVQQGEwJaQTElMCMGA1UEChMcVGhhd3RlIENvbnN1bHRpbmcg
KFB0eSkgTHRkLjEsMCoGA1UEAxMjVGhhd3RlIFBlcnNvbmFsIEZyZWVtYWlsIElzc3Vpbmcg
Q0ECEDsE+kRcmomW1hYG6BoqhGEwgYcGCyqGSIb3DQEJEAILMXigdjBiMQswCQYDVQQGEwJa
QTElMCMGA1UEChMcVGhhd3RlIENvbnN1bHRpbmcgKFB0eSkgTHRkLjEsMCoGA1UEAxMjVGhh
d3RlIFBlcnNvbmFsIEZyZWVtYWlsIElzc3VpbmcgQ0ECEDsE+kRcmomW1hYG6BoqhGEwDQYJ
KoZIhvcNAQEBBQAEggEAbRxdIrQqJESt/SbIWVfwD3xJbv4CQpKn1aA4SLUlPcXykyRnaLFX
Tnyi8u64G+0o4f4BhIJl30TPi2jKtCSTS8abLNbW40SUT5ogi10BicgjyhUWFh8YAS7424u9
FHFhriDzPjaVwrc8AxivEIM7OT0QHKV6hra3hYQKucYKA0kIHOUHxq+Y99xMqHt993IJbn5q
CaBVDYRnXPxHLLt3n9fZ9KhsESgcTiX8kzum4rWB9a7sBTv4/desIPeu58zW/kwsJpLtRrCD
cFvJ1/2CP8BJbRrkldWOB/jL7iRN+NPZ8cY0uVJJmFJAEue1wWU7qAniRf5JRXKYFBfYqptv
TAAAAAAAAA==
--------------ms040305030605060801040901--