One disk with bad sectors: what steps to take?

Discussion in 'Storage' started by jfr2006, Jun 10, 2011.

Thread Status:
Not open for further replies.
  1. jfr2006 New Member

    Member Since:
    May 27, 2011
    Message Count:
    180
    Likes Received:
    1
    Trophy Points:
    0
    jfr2006, Jun 10, 2011

    Yesterday i found out than one the Samsung disks has bad sectors (very common to this drive. It's the third one i catch with bad sectors :mad:):

    Code (text):
    1.  
    2. freenas# zpool status
    3.   pool: volume1
    4.  state: ONLINE
    5. status: One or more devices has experienced an unrecoverable error.  An
    6.         attempt was made to correct the error.  Applications are unaffected.
    7. action: Determine if the device needs to be replaced, and clear the errors
    8.         using 'zpool clear' or replace the device with 'zpool replace'.
    9.    see: http://www.sun.com/msg/ZFS-8000-9P
    10.  scrub: none requested
    11. config:
    12.  
    13.         NAME                                            STATE     READ WRITE CKSUM
    14.         volume1                                         ONLINE       0     0     0
    15.           raidz1                                        ONLINE       0     0     0
    16.             gptid/be4f7e18-91d9-11e0-869c-f46d04923303  ONLINE   56.8K 14.3M     0
    17.             gptid/bea6a51b-91d9-11e0-869c-f46d04923303  ONLINE       0     0     0
    18.             gptid/bf1702f1-91d9-11e0-869c-f46d04923303  ONLINE       0     0     0
    19.             gptid/bf6feb92-91d9-11e0-869c-f46d04923303  ONLINE       0     0     0
    20.             gptid/bfd28ef2-91d9-11e0-869c-f46d04923303  ONLINE       0     0     0
    21.  
    22. errors: No known data errors
    23.  
    24.  
    So, my question now is: what are the best steps to take now to replace this disk ?

    What i want to know, is the sequence of commands to give on the command line or steps to take in the GUI.

    Regards.
  2. freeflow New Member

    Member Since:
    May 29, 2011
    Message Count:
    38
    Likes Received:
    1
    Trophy Points:
    0
    freeflow, Jun 10, 2011

    Use Samsung ESTOOLS to do a full media scan.

    If ESTOOLS reports errors then replace the disk.

    If ESTOOLS reports an OK disk then replace the sata cable, do a zpool clear, then do a zpool scrub

    After the scrub ckeck status of RaidZ1 and smart data of affected drive.

    If there are no zpool errors then possibly just a dodgy sata cable/connection. Check smart data and decide if disk needs replacing. Copy data to disk and then review as above.

    If there are zpool errors then replace disk. etc etc etc
  3. jfr2006 New Member

    Member Since:
    May 27, 2011
    Message Count:
    180
    Likes Received:
    1
    Trophy Points:
    0
    jfr2006, Jun 11, 2011

    Yes, i intend to use ESTOOLS to scan the disk. It's not the first time i have a samsung with bad sectors :(

    But i don't think the problem is on the SATA cable..They are very good quality and have clips on the end, so they stay well attached :)

    Regards.
  4. jfr2006 New Member

    Member Since:
    May 27, 2011
    Message Count:
    180
    Likes Received:
    1
    Trophy Points:
    0
    jfr2006, Jun 13, 2011

    Ok..done some testing with estools, no errors found (complete surface scan). Found out that the power cable was having bad contacts, making the disc turn off and on. Put the disc in place, replace the power cable, and after about an hour, i get this:


    should i get worried? The data cable is ok, since it's the same for all discs. I can try to exchange it, however...
  5. jafin Member

    Member Since:
    May 30, 2011
    Message Count:
    32
    Likes Received:
    0
    Trophy Points:
    6
    Occupation:
    NAS Tinkerer
    Location:
    Australia
    jafin, Jun 13, 2011

    Is the disk still turning on/off? Perhaps your disk platter is ok but the disk drive motor is flakey? May be identified if you run some sort of exhaustive test that goes for a few hours on the drive. (not sure if samsung es tools has this)
  6. jfr2006 New Member

    Member Since:
    May 27, 2011
    Message Count:
    180
    Likes Received:
    1
    Trophy Points:
    0
    jfr2006, Jun 14, 2011

    Hi:

    Problem solved.. The CKSUM errors were inherited from the previous behavior of the disk. And the problem was indeed the power cable. SATA Power connectors is the worst invention ever made :mad:

    Regards.
Thread Status:
Not open for further replies.

Share This Page