sandbox.zend.com - Community Server out of action: due for hardware replacement

classic Classic list List threaded Threaded
6 messages Options
Reply | Threaded
Open this post in threaded view
|

sandbox.zend.com - Community Server out of action: due for hardware replacement

GavinZend
FYI, I was using the community server last night, and the filesystems
locked up, became "read-only" with the server giving all the signs of
having difficulty reading the hard drive.  We are recommending to have
the server replaced due to hardware failure.  We will keep you posted on
the status.

Hard disk is toast according to smartctl:

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  
LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed: read failure       90%    
20553         76799128

Cheers,
Gavin

Richard Thomas wrote:

> Its back up and running, for the admins /etc/rc.d/init.d/wildfired
> start if it crashes in the future.
>
> On 2/15/07, flipkick <[hidden email]> wrote:
>>
>> there is a link on the zd dev homepage. sorry, never noticed that..
>> --
>> View this message in context:
>> http://www.nabble.com/Community-chat-tf2338538s16154.html#a8998653
>> Sent from the Zend Framework mailing list archive at Nabble.com.
>>
Reply | Threaded
Open this post in threaded view
|

Re: sandbox.zend.com - Community Server out of action: due for hardware replacement

cyberlot
Don't know if you have a seperate ticket open but this is whats going
on as this time, If they do swap out the drives they will put the old
drive in as a secondary so we can recover the backup files.

Im a little short on time so I don't know when I will be able to
replicate the php setup.


2/16/2007 12:26:45 PM
WebTech
Dear Ian East:

The smartd errors you pasted into the ticket and the history of this
issue indicate either a bad IDE cable or a hard drive that is failing.
I would like to have the IDE cable replaced to see if this will help.
If it does not help, the hard drive will need to be replaced. Please
update this ticket and let us know when we may have the IDE cable
replaced.

Jared R. - RHCE
Technical Support Specialist
http://www.theplanet.com



On 2/16/07, Gavin Vess <[hidden email]> wrote:

> FYI, I was using the community server last night, and the filesystems
> locked up, became "read-only" with the server giving all the signs of
> having difficulty reading the hard drive.  We are recommending to have
> the server replaced due to hardware failure.  We will keep you posted on
> the status.
>
> Hard disk is toast according to smartctl:
>
> SMART Self-test log structure revision number 1
> Num  Test_Description    Status                  Remaining
> LifeTime(hours)  LBA_of_first_error
> # 1  Short offline       Completed: read failure       90%
> 20553         76799128
>
> Cheers,
> Gavin
>
> Richard Thomas wrote:
> > Its back up and running, for the admins /etc/rc.d/init.d/wildfired
> > start if it crashes in the future.
> >
> > On 2/15/07, flipkick <[hidden email]> wrote:
> >>
> >> there is a link on the zd dev homepage. sorry, never noticed that..
> >> --
> >> View this message in context:
> >> http://www.nabble.com/Community-chat-tf2338538s16154.html#a8998653
> >> Sent from the Zend Framework mailing list archive at Nabble.com.
> >>
>
Reply | Threaded
Open this post in threaded view
|

Re: sandbox.zend.com - Community Server out of action: due for hardware replacement

GavinZend
Hi Richard,

No problems .. I know we are all super busy :)

I'm not happy that EV1 is too cheap to just put in a new drive and a new
cable, but nothing I can do about it.

After they install the new cable, we can stress the drive out with a
bunch of hdparm tests and "long" type smartctl tests until the errors
surface again, and then ask for a new drive.  Also, one way to make a
drive crash faster ... just stick it in a loop reading from the sectors
that are already bad.  This basically digs a physical hole into the
damaged platter, kicking up more dust and debris that makes the drive
fail faster.

Cheers,
Gavin

Richard Thomas wrote:

> Don't know if you have a seperate ticket open but this is whats going
> on as this time, If they do swap out the drives they will put the old
> drive in as a secondary so we can recover the backup files.
>
> Im a little short on time so I don't know when I will be able to
> replicate the php setup.
>
>
> 2/16/2007 12:26:45 PM
> WebTech
> Dear Ian East:
>
> The smartd errors you pasted into the ticket and the history of this
> issue indicate either a bad IDE cable or a hard drive that is failing.
> I would like to have the IDE cable replaced to see if this will help.
> If it does not help, the hard drive will need to be replaced. Please
> update this ticket and let us know when we may have the IDE cable
> replaced.
>
> Jared R. - RHCE
> Technical Support Specialist
> http://www.theplanet.com
>
>
>
> On 2/16/07, Gavin Vess <[hidden email]> wrote:
>> FYI, I was using the community server last night, and the filesystems
>> locked up, became "read-only" with the server giving all the signs of
>> having difficulty reading the hard drive.  We are recommending to have
>> the server replaced due to hardware failure.  We will keep you posted on
>> the status.
>>
>> Hard disk is toast according to smartctl:
>>
>> SMART Self-test log structure revision number 1
>> Num  Test_Description    Status                  Remaining
>> LifeTime(hours)  LBA_of_first_error
>> # 1  Short offline       Completed: read failure       90%
>> 20553         76799128
>>
>> Cheers,
>> Gavin
>>
>> Richard Thomas wrote:
>> > Its back up and running, for the admins /etc/rc.d/init.d/wildfired
>> > start if it crashes in the future.
>> >
>> > On 2/15/07, flipkick <[hidden email]> wrote:
>> >>
>> >> there is a link on the zd dev homepage. sorry, never noticed that..
>> >> --
>> >> View this message in context:
>> >> http://www.nabble.com/Community-chat-tf2338538s16154.html#a8998653
>> >> Sent from the Zend Framework mailing list archive at Nabble.com.
>> >>

Reply | Threaded
Open this post in threaded view
|

Re: sandbox.zend.com - Community Server out of action: due for hardware replacement

cyberlot
They are going to install a new drive, and put the old drive in so we
can pull the data off it.

I will see if I can make some time this weekend to get ensim caught up
so we can restore the sites that have been backup. If we restore them
right away there will be files missing due to rpm's I installed which
might result in php not having all the links it needs to run.

On 2/16/07, Gavin Vess <[hidden email]> wrote:

> Hi Richard,
>
> No problems .. I know we are all super busy :)
>
> I'm not happy that EV1 is too cheap to just put in a new drive and a new
> cable, but nothing I can do about it.
>
> After they install the new cable, we can stress the drive out with a
> bunch of hdparm tests and "long" type smartctl tests until the errors
> surface again, and then ask for a new drive.  Also, one way to make a
> drive crash faster ... just stick it in a loop reading from the sectors
> that are already bad.  This basically digs a physical hole into the
> damaged platter, kicking up more dust and debris that makes the drive
> fail faster.
>
> Cheers,
> Gavin
>
> Richard Thomas wrote:
> > Don't know if you have a seperate ticket open but this is whats going
> > on as this time, If they do swap out the drives they will put the old
> > drive in as a secondary so we can recover the backup files.
> >
> > Im a little short on time so I don't know when I will be able to
> > replicate the php setup.
> >
> >
> > 2/16/2007 12:26:45 PM
> > WebTech
> > Dear Ian East:
> >
> > The smartd errors you pasted into the ticket and the history of this
> > issue indicate either a bad IDE cable or a hard drive that is failing.
> > I would like to have the IDE cable replaced to see if this will help.
> > If it does not help, the hard drive will need to be replaced. Please
> > update this ticket and let us know when we may have the IDE cable
> > replaced.
> >
> > Jared R. - RHCE
> > Technical Support Specialist
> > http://www.theplanet.com
> >
> >
> >
> > On 2/16/07, Gavin Vess <[hidden email]> wrote:
> >> FYI, I was using the community server last night, and the filesystems
> >> locked up, became "read-only" with the server giving all the signs of
> >> having difficulty reading the hard drive.  We are recommending to have
> >> the server replaced due to hardware failure.  We will keep you posted on
> >> the status.
> >>
> >> Hard disk is toast according to smartctl:
> >>
> >> SMART Self-test log structure revision number 1
> >> Num  Test_Description    Status                  Remaining
> >> LifeTime(hours)  LBA_of_first_error
> >> # 1  Short offline       Completed: read failure       90%
> >> 20553         76799128
> >>
> >> Cheers,
> >> Gavin
> >>
> >> Richard Thomas wrote:
> >> > Its back up and running, for the admins /etc/rc.d/init.d/wildfired
> >> > start if it crashes in the future.
> >> >
> >> > On 2/15/07, flipkick <[hidden email]> wrote:
> >> >>
> >> >> there is a link on the zd dev homepage. sorry, never noticed that..
> >> >> --
> >> >> View this message in context:
> >> >> http://www.nabble.com/Community-chat-tf2338538s16154.html#a8998653
> >> >> Sent from the Zend Framework mailing list archive at Nabble.com.
> >> >>
>
>
Reply | Threaded
Open this post in threaded view
|

Re: sandbox.zend.com - Community Server out of action: due for hardware replacement

cyberlot
The box is supposed to be reloaded with the old drive in as slave but
its not responding at this time.

On 2/17/07, Richard Thomas <[hidden email]> wrote:

> They are going to install a new drive, and put the old drive in so we
> can pull the data off it.
>
> I will see if I can make some time this weekend to get ensim caught up
> so we can restore the sites that have been backup. If we restore them
> right away there will be files missing due to rpm's I installed which
> might result in php not having all the links it needs to run.
>
> On 2/16/07, Gavin Vess <[hidden email]> wrote:
> > Hi Richard,
> >
> > No problems .. I know we are all super busy :)
> >
> > I'm not happy that EV1 is too cheap to just put in a new drive and a new
> > cable, but nothing I can do about it.
> >
> > After they install the new cable, we can stress the drive out with a
> > bunch of hdparm tests and "long" type smartctl tests until the errors
> > surface again, and then ask for a new drive.  Also, one way to make a
> > drive crash faster ... just stick it in a loop reading from the sectors
> > that are already bad.  This basically digs a physical hole into the
> > damaged platter, kicking up more dust and debris that makes the drive
> > fail faster.
> >
> > Cheers,
> > Gavin
> >
> > Richard Thomas wrote:
> > > Don't know if you have a seperate ticket open but this is whats going
> > > on as this time, If they do swap out the drives they will put the old
> > > drive in as a secondary so we can recover the backup files.
> > >
> > > Im a little short on time so I don't know when I will be able to
> > > replicate the php setup.
> > >
> > >
> > > 2/16/2007 12:26:45 PM
> > > WebTech
> > > Dear Ian East:
> > >
> > > The smartd errors you pasted into the ticket and the history of this
> > > issue indicate either a bad IDE cable or a hard drive that is failing.
> > > I would like to have the IDE cable replaced to see if this will help.
> > > If it does not help, the hard drive will need to be replaced. Please
> > > update this ticket and let us know when we may have the IDE cable
> > > replaced.
> > >
> > > Jared R. - RHCE
> > > Technical Support Specialist
> > > http://www.theplanet.com
> > >
> > >
> > >
> > > On 2/16/07, Gavin Vess <[hidden email]> wrote:
> > >> FYI, I was using the community server last night, and the filesystems
> > >> locked up, became "read-only" with the server giving all the signs of
> > >> having difficulty reading the hard drive.  We are recommending to have
> > >> the server replaced due to hardware failure.  We will keep you posted on
> > >> the status.
> > >>
> > >> Hard disk is toast according to smartctl:
> > >>
> > >> SMART Self-test log structure revision number 1
> > >> Num  Test_Description    Status                  Remaining
> > >> LifeTime(hours)  LBA_of_first_error
> > >> # 1  Short offline       Completed: read failure       90%
> > >> 20553         76799128
> > >>
> > >> Cheers,
> > >> Gavin
> > >>
> > >> Richard Thomas wrote:
> > >> > Its back up and running, for the admins /etc/rc.d/init.d/wildfired
> > >> > start if it crashes in the future.
> > >> >
> > >> > On 2/15/07, flipkick <[hidden email]> wrote:
> > >> >>
> > >> >> there is a link on the zd dev homepage. sorry, never noticed that..
> > >> >> --
> > >> >> View this message in context:
> > >> >> http://www.nabble.com/Community-chat-tf2338538s16154.html#a8998653
> > >> >> Sent from the Zend Framework mailing list archive at Nabble.com.
> > >> >>
> >
> >
>
Reply | Threaded
Open this post in threaded view
|

Re: sandbox.zend.com - Community Server out of action: due for hardware replacement

cyberlot
I copied the whole old drive into /root/OLDDRIVE had a couple errors
so not all the files are good

Am in the process of restoring the backups, Its possible this will
bring the domains online but they will be fragile until I get the base
system back to where it was.

On 2/17/07, Richard Thomas <[hidden email]> wrote:

> The box is supposed to be reloaded with the old drive in as slave but
> its not responding at this time.
>
> On 2/17/07, Richard Thomas <[hidden email]> wrote:
> > They are going to install a new drive, and put the old drive in so we
> > can pull the data off it.
> >
> > I will see if I can make some time this weekend to get ensim caught up
> > so we can restore the sites that have been backup. If we restore them
> > right away there will be files missing due to rpm's I installed which
> > might result in php not having all the links it needs to run.
> >
> > On 2/16/07, Gavin Vess <[hidden email]> wrote:
> > > Hi Richard,
> > >
> > > No problems .. I know we are all super busy :)
> > >
> > > I'm not happy that EV1 is too cheap to just put in a new drive and a new
> > > cable, but nothing I can do about it.
> > >
> > > After they install the new cable, we can stress the drive out with a
> > > bunch of hdparm tests and "long" type smartctl tests until the errors
> > > surface again, and then ask for a new drive.  Also, one way to make a
> > > drive crash faster ... just stick it in a loop reading from the sectors
> > > that are already bad.  This basically digs a physical hole into the
> > > damaged platter, kicking up more dust and debris that makes the drive
> > > fail faster.
> > >
> > > Cheers,
> > > Gavin
> > >
> > > Richard Thomas wrote:
> > > > Don't know if you have a seperate ticket open but this is whats going
> > > > on as this time, If they do swap out the drives they will put the old
> > > > drive in as a secondary so we can recover the backup files.
> > > >
> > > > Im a little short on time so I don't know when I will be able to
> > > > replicate the php setup.
> > > >
> > > >
> > > > 2/16/2007 12:26:45 PM
> > > > WebTech
> > > > Dear Ian East:
> > > >
> > > > The smartd errors you pasted into the ticket and the history of this
> > > > issue indicate either a bad IDE cable or a hard drive that is failing.
> > > > I would like to have the IDE cable replaced to see if this will help.
> > > > If it does not help, the hard drive will need to be replaced. Please
> > > > update this ticket and let us know when we may have the IDE cable
> > > > replaced.
> > > >
> > > > Jared R. - RHCE
> > > > Technical Support Specialist
> > > > http://www.theplanet.com
> > > >
> > > >
> > > >
> > > > On 2/16/07, Gavin Vess <[hidden email]> wrote:
> > > >> FYI, I was using the community server last night, and the filesystems
> > > >> locked up, became "read-only" with the server giving all the signs of
> > > >> having difficulty reading the hard drive.  We are recommending to have
> > > >> the server replaced due to hardware failure.  We will keep you posted on
> > > >> the status.
> > > >>
> > > >> Hard disk is toast according to smartctl:
> > > >>
> > > >> SMART Self-test log structure revision number 1
> > > >> Num  Test_Description    Status                  Remaining
> > > >> LifeTime(hours)  LBA_of_first_error
> > > >> # 1  Short offline       Completed: read failure       90%
> > > >> 20553         76799128
> > > >>
> > > >> Cheers,
> > > >> Gavin
> > > >>
> > > >> Richard Thomas wrote:
> > > >> > Its back up and running, for the admins /etc/rc.d/init.d/wildfired
> > > >> > start if it crashes in the future.
> > > >> >
> > > >> > On 2/15/07, flipkick <[hidden email]> wrote:
> > > >> >>
> > > >> >> there is a link on the zd dev homepage. sorry, never noticed that..
> > > >> >> --
> > > >> >> View this message in context:
> > > >> >> http://www.nabble.com/Community-chat-tf2338538s16154.html#a8998653
> > > >> >> Sent from the Zend Framework mailing list archive at Nabble.com.
> > > >> >>
> > >
> > >
> >
>