[OpenAFS] Problems Setting up new AFS cell in AWS

Victor Marmol vmarmol@gmail.com
Mon, 17 Jun 2013 20:40:01 -0700


--001a11c25f16fab0ad04df6578b1
Content-Type: text/plain; charset=ISO-8859-1

Hi,

I am setting up a new AFS cell on AWS and am running into some issues
during afs-newcell. I am setting up the first AFS server with all the
components. When I first setup a cell everything worked fine except that it
was inaccessible outside of AWS. It seemed that the fileserver and dbserver
were being resolved to the instance's private hostname and IP inside AWS's
NAT. On my second try I edited afs-newcell to use my desired hostname
instead of the instance's. This hostname has a public IP and is accessible
from outside AWS

The error I am getting now is that ptserver does not seem to be happy
seeing that the local hostname and the hostname I am setting up do not
agree. The PtLog says:

Tue Jun 18 03:31:55 2013 Inconsistent Cell Info from server: Tue Jun 18
03:31:55 2013 <instance's private NAT IP> Tue Jun 18 03:31:55 2013
Tue Jun 18 03:31:55 2013 Local CellServDB:Tue Jun 18 03:31:55 2013 Server
1: Tue Jun 18 03:31:55 2013 <instance's public IP> Tue Jun 18 03:31:55 2013
Tue Jun 18 03:31:55 2013 Inconsistent Cell Info on server: Tue Jun 18
03:31:55 2013 <instance's public IP> Tue Jun 18 03:31:55 2013
ptserver: problems with host name Ubik init failed

afs-newcell fails when it tries to contact bos. The BosLog says:

Tue Jun 18 03:31:54 2013: Core limits now -1 -1
Tue Jun 18 03:31:54 2013: Server directory access is okay
Tue Jun 18 03:31:54 2013: Listening on 0.0.0.0:7007
Tue Jun 18 03:31:54 2013: vlserver exited with code 2
...
Tue Jun 18 03:31:54 2013: ptserver exited with code 2
Tue Jun 18 03:31:54 2013: BNODE 'ptserver' repeatedly failed to start,
perhaps missing executable.
Tue Jun 18 03:31:54 2013: vlserver exited with code 2
Tue Jun 18 03:31:54 2013: BNODE 'vlserver' repeatedly failed to start,
perhaps missing executable.
Tue Jun 18 03:31:55 2013: ptserver exited with code 2
Tue Jun 18 03:31:55 2013: BNODE 'ptserver' repeatedly failed to start,
perhaps missing executable.
Tue Jun 18 03:31:55 2013: vlserver exited with code 2
Tue Jun 18 03:31:55 2013: BNODE 'vlserver' repeatedly failed to start,
perhaps missing executable.
Tue Jun 18 03:35:20 2013: dafs:salsrv exited with code 1

>From the little I found online there were some references to a NetInfo
file, but I was unable to find any good documentation for it. The file does
not appear in my instance.

Has anyone had similar experiences setting up an AFS cell in AWS? or behind
a NAT?

Any help would be greatly appreciated!

Thank you!
Victor

--001a11c25f16fab0ad04df6578b1
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi,<div><br></div><div>I am setting up a new AFS cell on A=
WS and am running into some issues during afs-newcell. I am setting up the =
first AFS server with all the components. When I first setup a cell everyth=
ing worked fine except that it was inaccessible outside of AWS. It seemed t=
hat the fileserver and dbserver were being resolved to the instance&#39;s p=
rivate hostname and IP inside AWS&#39;s NAT. On my second try I edited afs-=
newcell to use my desired hostname instead of the instance&#39;s. This host=
name has a public IP and is accessible from outside AWS</div>

<div><br></div><div>The error I am getting now is that ptserver does not se=
em to be happy seeing that the local hostname and the hostname I am setting=
 up do not agree. The PtLog says:</div><div><br></div><div><div>Tue Jun 18 =
03:31:55 2013 Inconsistent Cell Info from server: Tue Jun 18 03:31:55 2013 =
&lt;instance&#39;s private NAT IP&gt; Tue Jun 18 03:31:55 2013</div>

<div>Tue Jun 18 03:31:55 2013 Local CellServDB:Tue Jun 18 03:31:55 2013 Ser=
ver 1: Tue Jun 18 03:31:55 2013 &lt;instance&#39;s public IP&gt; Tue Jun 18=
 03:31:55 2013</div><div>Tue Jun 18 03:31:55 2013 Inconsistent Cell Info on=
 server: Tue Jun 18 03:31:55 2013=A0&lt;instance&#39;s public IP&gt;=A0Tue =
Jun 18 03:31:55 2013</div>

<div>ptserver: problems with host name Ubik init failed</div></div><div><br=
></div><div>afs-newcell fails when it tries to contact bos. The BosLog says=
:</div><div><br></div><div><div>Tue Jun 18 03:31:54 2013: Core limits now -=
1 -1</div>

<div>Tue Jun 18 03:31:54 2013: Server directory access is okay</div><div>Tu=
e Jun 18 03:31:54 2013: Listening on <a href=3D"http://0.0.0.0:7007">0.0.0.=
0:7007</a></div><div>Tue Jun 18 03:31:54 2013: vlserver exited with code 2<=
/div>

<div>...</div><div>Tue Jun 18 03:31:54 2013: ptserver exited with code 2</d=
iv><div>Tue Jun 18 03:31:54 2013: BNODE &#39;ptserver&#39; repeatedly faile=
d to start, perhaps missing executable.</div><div>Tue Jun 18 03:31:54 2013:=
 vlserver exited with code 2</div>

<div>Tue Jun 18 03:31:54 2013: BNODE &#39;vlserver&#39; repeatedly failed t=
o start, perhaps missing executable.</div><div>Tue Jun 18 03:31:55 2013: pt=
server exited with code 2</div><div>Tue Jun 18 03:31:55 2013: BNODE &#39;pt=
server&#39; repeatedly failed to start, perhaps missing executable.</div>

<div>Tue Jun 18 03:31:55 2013: vlserver exited with code 2</div><div>Tue Ju=
n 18 03:31:55 2013: BNODE &#39;vlserver&#39; repeatedly failed to start, pe=
rhaps missing executable.</div><div>Tue Jun 18 03:35:20 2013: dafs:salsrv e=
xited with code 1</div>

</div><div><br></div><div>From the little I found online there were some re=
ferences to a NetInfo file, but I was unable to find any good documentation=
 for it. The file does not appear in my instance.</div><div><br></div>
<div>
Has anyone had similar experiences setting up an AFS cell in AWS? or behind=
 a NAT?</div><div><br></div><div>Any help would be greatly appreciated!</di=
v><div><br></div><div>Thank you!</div><div>Victor</div></div>

--001a11c25f16fab0ad04df6578b1--