PS3 (Research/Experimental) - NEC/TOKIN Capacitors Replacement - YLOD

Here's the only photo I have of the delid as of now. I had to reduce the quality of the pic to upload it here so hopefully it doesn't look too bad.

EDIT: Pressure test not successful. Board looks very clean. Please somebody give me some hope on this thing. If better pictures are needed then let me know
 

Attachments

  • IMG_3817.jpg
    IMG_3817.jpg
    1 MB · Views: 171
Last edited:
So following my posts above I ended up replacing both B side Nec/Tokins (RSX) - each with 3x 2R5TPE470M9 470 2.5V 470UF from Aliexpress.
It booted first time!! (Which didn't happen since the YLODs started, I would always have to remove the hdd, turn it on after 10+ tries and let the unit warmup or heat the tokins directly.)

Going to leave some more info on my unit, so people don't read this and assume Tantalums will fix their unit too.
I never had any graphical issues, GPU artifacts or even overheating, just YLODs on start, sporadic ones after some time of gameplay, and some games like Dragon Ball: Raging Blast 2, Dead Or Alive 5: Last Round, would instantly YLOD the unit.
Also this is a 300 days registered uptime unit that still had the warranty seal, so again, a good candidate faulty/end of life Nec/Tokins.
It's been working fine with no ylods for a week with heavy usage. Let's hope...I'll try no to forget to report back in a year or so.
 
Hi, guys, I also started acting before bumping (pun intended :D) into your posts and findings, @RIP-Felix. :cold: Although these posts of yours might not have been there yet. Or they might have just come out, I don't know.

So, I bought a CECHC04 about three years or so ago and it actually booted and I tried playing one of the Gran Turismo games for about half an hour. No problems. But because I had read about these NEC/Tokin supposedly being bad and in need of replacement, working as an EE I thought replacing them and saving the board for the years to come would be a fun project. However, I suppose I didn't read carefully enough and started with one of the CPU (Cell) caps on top of the PCB. I desoldered it quite nicely and then just put the board in a drawer.
Few years forward to last week: I soldered four of those inappropriate yellow tantalums with too high of an ESR on and thought I'd give it a try. Then I reassembled everything and nothing happened. Not even a single LED would light up. So, after some thought and some googling I concluded the PSU caps dried up and it's failing to provide enough power. I therefore tried hooking my lab supply at 12V to power pins (with correct polarity). Is it possible I killed the board this way? Well, it turned out the goddamn power card flat cable was the problem ... After that I got a YLOD.

Now, here are the running time:
becount

Bringup : 711 times
Shutdown: 636 times
Power-on: 40day 06hour 43min 08sec

the error log (a bunch of 1001s from before and then a bunch of 3001 from now, I think):
errlog

ofst[124]:err_code:0xffffffff, clock:0x14d4a76b 2011/01/27 22:00:11
ofst[ 0]:err_code:0xa0801001, clock:0x16db8561 2012/02/25 11:41:21
ofst[ 4]:err_code:0xa0801001, clock:0x180efecc 2012/10/15 17:05:16
ofst[ 8]:err_code:0xa0801001, clock:0x181042da 2012/10/16 16:07:54
ofst[ 12]:err_code:0xa0801001, clock:0x1857a929 2012/12/09 19:55:21
ofst[ 16]:err_code:0xa0801001, clock:0x1857a982 2012/12/09 19:56:50
ofst[ 20]:err_code:0xa0801001, clock:0x188051c4 2013/01/09 16:05:24
ofst[ 24]:err_code:0xa0901001, clock:0x1880523a 2013/01/09 16:07:22
ofst[ 28]:err_code:0xa0801001, clock:0x192fdfde 2013/05/22 19:58:22
ofst[ 32]:err_code:0xa0801001, clock:0x198be46c 2013/07/31 15:06:20
ofst[ 36]:err_code:0xa0801001, clock:0x199e9aaf 2013/08/14 19:44:47
ofst[ 40]:err_code:0xa0801001, clock:0x1a9f5303 2014/02/25 13:11:31
ofst[ 44]:err_code:0xa0003001, clock:0xffffffff
ofst[ 48]:err_code:0xa0003001, clock:0xffffffff
ofst[ 52]:err_code:0xa0003001, clock:0xffffffff
ofst[ 56]:err_code:0xa0003001, clock:0xffffffff
ofst[ 60]:err_code:0xa0003001, clock:0xffffffff
ofst[ 64]:err_code:0xa0003001, clock:0xffffffff
ofst[ 68]:err_code:0xa0003001, clock:0xffffffff
ofst[ 72]:err_code:0xa0003001, clock:0xffffffff
ofst[ 76]:err_code:0xa0003001, clock:0xffffffff
ofst[ 80]:err_code:0xa0003001, clock:0xffffffff
ofst[ 84]:err_code:0xa0003001, clock:0xffffffff
ofst[ 88]:err_code:0xa0003001, clock:0xffffffff
ofst[ 92]:err_code:0xa0003001, clock:0xffffffff
ofst[ 96]:err_code:0xa0003001, clock:0xffffffff
ofst[100]:err_code:0xa0003001, clock:0xffffffff
ofst[104]:err_code:0xa0003001, clock:0xffffffff
ofst[108]:err_code:0xa0003001, clock:0xffffffff
ofst[112]:err_code:0xa0003001, clock:0xffffffff
ofst[116]:err_code:0xa0003001, clock:0xffffffff
ofst[120]:err_code:0xa0003001, clock:0xffffffff

and the bringup output
bringup

[SSM] state: 0000 -> 0101
Bringup Mode #0 (0xFF)
[SSM] ssmCb_OnStartingBePowOn() called.
[SSM] First Boot.
[SSM] Bringup mode : syspm_stat=00000000/00000000
[POWSEQ] PowerSeq_Setup called.
[SSM] state: 0101 -> 0301
[SSM] PowSeq Fail : Detected !

[SSM] state: 0301 -> 0700
[POWSEQ] AV Backend Letup
[SSM] Shutdown mode : syspm_stat=00000000/00000000
[ERROR]: 0xa0003001
[ERROR]: 0xa0003001

So, what do you experts think? I have to check and measure the power rail at the replaced cap and then absolutely remove these four (and replace them later on). I just didn't have time yet.
Oh, can I replace my caps with a 1000uF and a 330uF TaPolys? I think the 1000uF one should be fine given its ESR/impedance characteristics (sorry, can't post a link yet, it's Kemet's) – I think they are very decent with an ESR of 5mOhm at critical freqs. The 330uF one is a bit worse at higher frequencies impedance-wise. I'd also throw in a couple of ceramic ones (100nF, 10uF, 20uF).

I just hope 3001 doesn't mean the board is dead ...
 
3001 is usually an issue with the 12v rail. Either a faulty PSU or a short on the board on that rail.
If your PSU is ok then you'll need to track the short. It is quite common on COK/SEM & DIA revision boards to have one or more faulty MLCC's on either the CELL or RSX VDDC input rails.
Check with your MM if the 12v rail is short.
If it is,I usually lift one side of the inductors on both rails to separate them from the main 12v rail and see which side the short is.
 
Last edited by a moderator:
Hmm, I suppose the PSU really is dying. The 5V voltage drops really quickly when I load it. At 0.5 A (the APS-227 is rated to 3A at 5V) it's already below 2.5V. That can't work ...
 
Sorry, I can't edit my previous post.

So, I was able to find what the problem was in my case. The 200k pull-up R6018 was missing. I replaced it and now the system boots up. Later, when I was thinking how come it's missing, I figured I must have tore it off when I was sanding the GND edge around the board with sanding paper – it was somehow corroded/oxidized. I don't know what the previous owner was doing with it ...

Now I just have to replace those high ESR caps (fortunately I only removed one tokin – it was actually an RSX one C6141, not a cell one as I said before) with some decent ones and replace the thermal paste again. Then I suppose I'll have a working PS3 again :victorious: Don't know for how long, though ... There are no signs of bad RSX bumps/broken BGA balls yet. Before my fuckup there were only 0xa0801001 errors in the errlog. Someone did delid both big ICs before, however, so I don't know :distrust:
 
Hi all, I have a CECK with 423 days of uptime showing signs of NEC/TOKINs going bad. Randomly shuts down, throws 1002 codes, so I think I'm going to start by replacing one of the RSX tokins.

Now, I have 330uf caps from when I replaced a NEC/TOKIN on a 2001A slim. Can I use these for my K model? Or, do I have to buy 470uf caps?
 
Hey guys. I assume from the lack of replies that I can't use the 330uf caps on my fat model.

I've been looking at these: https://www.ebay.com/itm/4049462778...uid=nfRxWPQ1SiS&widget_ver=artemis&media=COPY

If I use those, should I still keep one original NEC/TOKIN to bridge the positive connections? Or, does this thing take care of that? I can't really tell.

Also, sorry if I'm asking questions that have been answered here before. I've only done the NEC/TOKINs once before
 
Hello everyone, a few days ago I received a ps3 (cechk04, DIA-002 board) from a friend that when gets turned on gets a YLOD after a few seconds and turns off.

I found this forum and decided to use syscon to diagnose before I started to manipulate the board without knowing exactly what was happening. The problem is that the error I get when I run bringup is a bunch of a0202121, which I have seen is not as documented as the rest.

I share my complete log hoping that someone can help me, or at least shed some light on the issue.

Code:
becount

Bringup : 7785 times

Shutdown: 6426 times

Power-on: 342day 17hour 51min 49sec

[mullion]$
bringup

[SSM] state: 0000 -> 0101

Bringup Mode #0 (0xFF)

[SSM] ssmCb_OnStartingBePowOn() called.

[SSM] First Boot.

[SSM] Bringup mode : syspm_stat=00000000/00000000

[POWSEQ] PowerSeq_Setup called.

[SSM] state: 0101 -> 0201

[POWSEQ] AV Backend Setup

[SSM] state: 0201 -> 0102

[ERROR]: 0xa0202121

[ERROR]: 0xa0202121

[SSM] state: 0102 -> 0202

[SSM] state: 0202 -> 0103

[ERROR]: 0xa0202121

[ERROR]: 0xa0202121

[SSM] state: 0103 -> 0203

[SSM] ssmCb_BeforeBeOn() called.

[SSM] state: 0203 -> 0104

[ERROR]: 0xa0202121

Psbd_SbTransMode_Half:0x20e7

[ERROR]: 0xa0202121

[ERROR]: 0xa0202121

[ERROR]: 0xa0202121
errlog

ofst[ 64]:err_code:0xffffffff, clock:0x2de8fc0d  2024/05/28 20:33:17

ofst[ 68]:err_code:0xa0202121, clock:0x2de8fc0d  2024/05/28 20:33:17

ofst[ 72]:err_code:0xa0202121, clock:0x2de8fc0d  2024/05/28 20:33:17

ofst[ 76]:err_code:0xa0202121, clock:0x2de8fc0d  2024/05/28 20:33:17

ofst[ 80]:err_code:0xa0202121, clock:0x2de8fc0d  2024/05/28 20:33:17

ofst[ 84]:err_code:0xa0802021, clock:0x2de8fc12  2024/05/28 20:33:22

ofst[ 88]:err_code:0xa0801002, clock:0x2de8fc13  2024/05/28 20:33:23

ofst[ 92]:err_code:0xa0202121, clock:0x2de8fc9f  2024/05/28 20:35:43

ofst[ 96]:err_code:0xa0202121, clock:0x2de8fc9f  2024/05/28 20:35:43

ofst[100]:err_code:0xa0202121, clock:0x2de8fc9f  2024/05/28 20:35:43

ofst[104]:err_code:0xa0202121, clock:0x2de8fc9f  2024/05/28 20:35:43

ofst[108]:err_code:0xa0202121, clock:0x2de8fc9f  2024/05/28 20:35:43

ofst[112]:err_code:0xa0202121, clock:0x2de8fc9f  2024/05/28 20:35:43

ofst[116]:err_code:0xa0202121, clock:0x2de8fc9f  2024/05/28 20:35:43

ofst[120]:err_code:0xa0202121, clock:0x2de8fc9f  2024/05/28 20:35:43

ofst[124]:err_code:0xa0202121, clock:0x2de8fc9f  2024/05/28 20:35:43

ofst[  0]:err_code:0xa0202121, clock:0x2de8fc9f  2024/05/28 20:35:43

ofst[  4]:err_code:0xa0802021, clock:0x2de8fca3  2024/05/28 20:35:47

ofst[  8]:err_code:0xa0801002, clock:0x2de8fca4  2024/05/28 20:35:48

ofst[ 12]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05

ofst[ 16]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05

ofst[ 20]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05

ofst[ 24]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05

ofst[ 28]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05

ofst[ 32]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05

ofst[ 36]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05

ofst[ 40]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05

ofst[ 44]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05

ofst[ 48]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05

ofst[ 52]:err_code:0xa0802021, clock:0x2de90d5f  2024/05/28 21:47:11

ofst[ 56]:err_code:0xa0801002, clock:0x2de90d5f  2024/05/28 21:47:11

ofst[ 60]:err_code:0xa0002120, clock:0x2de90d60  2024/05/28 21:47:12

Having 342 days of uptime makes me think it is because of the tokins, but all those 2121 errors keep me thinking is something else...
Thanks in advance.
 
Hey all. I have another CECHK01 model, with about 11 days of uptime when it came to me! After replacing the thermal paste, it acted like GLOD. After a few more attempts, it beeped three times and shut down. If I wait a while, it comes back on and is like GLOD. When I have this system hooked up to read from the syscon, it makes the script hang after the "ylod"

Here are the codes I was able to get:
Code:
>$ ERRLOG GET 00
00000000 A08014FF 2DF7E0D6
Press Ctrl+C to exit
>$ ERRLOG GET 01
00000000 A0801701 2DF7E0D6
Press Ctrl+C to exit
>$ ERRLOG GET 02
00000000 A0232102 2DF7D7F5
Press Ctrl+C to exit
>$ ERRLOG GET 03
00000000 A0232102 2DF7D7F1
Press Ctrl+C to exit
>$ ERRLOG GET 04
00000000 A0232102 2DF7D7ED
Press Ctrl+C to exit
>$ ERRLOG GET 05
00000000 A0232102 2DF7D7E9
Press Ctrl+C to exit
>$ ERRLOG GET 06
00000000 A0902120 2DF7D77B
Press Ctrl+C to exit
>$ ERRLOG GET 07
00000000 A0403034 2DF7D77B
Press Ctrl+C to exit
>$ ERRLOG GET 08
00000000 A0404002 2DF7D77B
Press Ctrl+C to exit
>$ ERRLOG GET 09
00000000 A0902120 2DF7D775
Press Ctrl+C to exit
>$

First time on a mullion syscon also! Anyway, I thought these errors are worrying, which is why I'm posting here. Is there anything I can try, or test? It would be a shame to lose such a low-mileage unit. If anyone has anything, please let me know!
 
I read Syscon and got error 3039, but there is not much described about this error, besides the beginning of the code being different from everything shown so far A052.

Model is CECHL04


Below is the full reading:
Code:
Press Ctrl+C to exit
>$ ERRLOG GET 00
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 01
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 02
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 03
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 04
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 05
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 06
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 07
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 08
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 09
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 0A
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 0B
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 0C
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 0D
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 0E
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 0F
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 10
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 11
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 12
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 13
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 14
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 15
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 16
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 17
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 18
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 19
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 1A
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 1B
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 1C
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 1D
00000000 A05114FF FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 1E
00000000 A0523039 FFFFFFFF
Press Ctrl+C to exit
>$ ERRLOG GET 1F

Any idea what it could be?
 
Last edited:
"I have a PS3 series 20 and have a NEC/Tokin connected to the RSX, NEC/Tokin with the series 0E108 totaling 4, if I replace it with NEC/Tokin series 0E128, can all NEC/Tokin series 0E108 be replaced with 0E128?"
 
hello,

I'm writing becouse I have got starnge issue, my cechc04 have had issue sycone code 1002, I have decided to replace nec tokin, however first try test on two another fat with the same symptomps (one cechl even no turn on, cechj the same issue as cechc04 cold start issue and turn off during stress)

>



First start with cechl, I have replaced one side 2 pcs - 100% succes run well run some games no issue
Than cechj replaed one side now console even no start (before 2 time start normaly) no yellow light only red blinking, I thought secon side issue replaced also the capacitors no change.

I left it for later.

Start with cechc04, replaced one side - turn on, all seams be ok no cold start issue started normally, run first game to make sterss coe cell and rsx, no issue worked well plaid about 30 min, try go out from game to start anothe one and here..........freeze can do nothing, power off.
Next game all was ok temperatures low near 68C and fun speed up....ok try go out the game the same situation, power off, turn on normally go to multiman hapend nothing black screan and freeze.
Ok, try disc game the blue light on and freeze, turn off to check bluray disc conection, all was ok.
Try turn on but only stright red blinking no yellow light.

Do you have idea what happened?
 
Does anyone know what would be causing these errors in the syscon? very short YLOD.

fst[ 64]:err_code:0xffffffff, clock:0x2ddb2102 2024/05/18 08:19:14
ofst[ 68]:err_code:0xa0022110, clock:0x2ddb2115 2024/05/18 08:19:33
ofst[ 72]:err_code:0xa0022110, clock:0x2ddb2125 2024/05/18 08:19:49
ofst[ 76]:err_code:0xa0022110, clock:0x2ddb212c 2024/05/18 08:19:56
ofst[ 80]:err_code:0xa0022110, clock:0x2ddb2134 2024/05/18 08:20:04
ofst[ 84]:err_code:0xa0022110, clock:0x2ddb2154 2024/05/18 08:20:36
ofst[ 88]:err_code:0xa0022110, clock:0x2ddb215b 2024/05/18 08:20:43
ofst[ 92]:err_code:0xa0022110, clock:0x2ddb2162 2024/05/18 08:20:50
ofst[ 96]:err_code:0xa0022110, clock:0x2ddb21cd 2024/05/18 08:22:37
ofst[100]:err_code:0xa0022110, clock:0x2ddb21d6 2024/05/18 08:22:46
ofst[104]:err_code:0xa0022110, clock:0x2ddb235a 2024/05/18 08:29:14
ofst[108]:err_code:0xa0022110, clock:0x2ddb235f 2024/05/18 08:29:19
ofst[112]:err_code:0xa0022110, clock:0x2ddb2374 2024/05/18 08:29:40
ofst[116]:err_code:0xa0022110, clock:0x2ddb2379 2024/05/18 08:29:45
ofst[120]:err_code:0xa0022110, clock:0x2ddb237c 2024/05/18 08:29:48
ofst[124]:err_code:0xa0022110, clock:0x2ddb2380 2024/05/18 08:29:52
ofst[ 0]:err_code:0xa0022110, clock:0x2ddb24d1 2024/05/18 08:35:29
ofst[ 4]:err_code:0xa0022110, clock:0x2dddf8a3 2024/05/20 12:03:47
ofst[ 8]:err_code:0xa0022110, clock:0x2dddf91e 2024/05/20 12:05:50
ofst[ 12]:err_code:0xa0022110, clock:0x2dfbc3aa 2024/06/12 02:25:46
ofst[ 16]:err_code:0xa0022010, clock:0x2e124a6d 2024/06/29 04:30:37
ofst[ 20]:err_code:0xa0022110, clock:0x2e124a73 2024/06/29 04:30:43
ofst[ 24]:err_code:0xa0022110, clock:0x2e124d59 2024/06/29 04:43:05
ofst[ 28]:err_code:0xa0093003, clock:0x2e124d6c 2024/06/29 04:43:24
ofst[ 32]:err_code:0xa0022110, clock:0x2e124d72 2024/06/29 04:43:30
ofst[ 36]:err_code:0xa0022110, clock:0x2e124d78 2024/06/29 04:43:36
ofst[ 40]:err_code:0xa0022110, clock:0x2e124d7b 2024/06/29 04:43:39
ofst[ 44]:err_code:0xa0022110, clock:0x2e124d7e 2024/06/29 04:43:42
ofst[ 48]:err_code:0xa0093003, clock:0x2e124de4 2024/06/29 04:45:24
ofst[ 52]:err_code:0xa0022110, clock:0x2e124de8 2024/06/29 04:45:28
ofst[ 56]:err_code:0xa0022110, clock:0x2e124deb 2024/06/29 04:45:31
ofst[ 60]:err_code:0xa0022110, clock:0x2e124dee 2024/06/29 04:45:34

Any help would be very much appreciated thanks, This is on a DIA-002 Board.
 
Hello... I just finished doing some tests on my console... these are the results:
PS3 fat - SEM-001 - without battery (exhausted)
-----------------------------
Code:
bringup
[SSM] state: 0000 -> 0101
Bringup Mode #0 (0xFF)
[SSM] ssmCb_OnStartingBePowOn() called.
[SSM] First Boot.
[SSM] Bringup mode : syspm_stat=00000000/00000000
[POWSEQ] PowerSeq_Setup called.
[SSM] state: 0101 -> 0201
[POWSEQ] AV Backend Setup
[SSM] state: 0201 -> 0102
[SSM] state: 0102 -> 0202
[SSM] state: 0202 -> 0103
[SSM] state: 0103 -> 0203
[SSM] ssmCb_BeforeBeOn() called.
[SSM] state: 0203 -> 0104
Psbd_SbTransMode_Half:0x20e7
Press Ctrl+C to exit
>$
[POWERSEQ] Error : BitTraining BE:RRAC:RX2:GLOBAL1:RX_STATUS
[SSM] state: 0104 -> 0304
[SSM] ssmCb_AfterBeOn2() called.
[SSM] PowSeq Fail : Detected !
[SSM] state: 0304 -> 0700
[POWSEQ] AV Backend Letup
[SSM] Shutdown mode : syspm_stat=00000000/00000000
[ERROR]: 0xa0404421
[ERROR]: 0xa0403034
[POWSEQ] PowerSeq_Letup called.
[SSM] state: 0700 -> 0600
(PowerOff State) (Fatal)

[mullion]$
Code:
ofst[  8]:err_code:0xffffffff, clock:0x152cd28f  2011/04/04 19:03:43
ofst[ 12]:err_code:0xa0403034, clock:0x152cd28f  2011/04/04 19:03:43
ofst[ 16]:err_code:0xa0404421, clock:0x195ca897  2013/06/25 19:14:31
ofst[ 20]:err_code:0xa0403034, clock:0x195ca897  2013/06/25 19:14:31
ofst[ 24]:err_code:0xa0404421, clock:0x195ca8a3  2013/06/25 19:14:43
ofst[ 28]:err_code:0xa0403034, clock:0x195ca8a3  2013/06/25 19:14:43
ofst[ 32]:err_code:0xa0404421, clock:0x195ca8ad  2013/06/25 19:14:53
ofst[ 36]:err_code:0xa0403034, clock:0x195ca8ad  2013/06/25 19:14:53
ofst[ 40]:err_code:0xa0404421, clock:0x195caa4b  2013/06/25 19:21:47
ofst[ 44]:err_code:0xa0403034, clock:0x195caa4b  2013/06/25 19:21:47
ofst[ 48]:err_code:0xa0404421, clock:0x195f30c2  2013/06/27 17:20:02
ofst[ 52]:err_code:0xa0403034, clock:0x195f30c2  2013/06/27 17:20:02
ofst[ 56]:err_code:0xa0404421, clock:0x195f30d1  2013/06/27 17:20:17
ofst[ 60]:err_code:0xa0403034, clock:0x195f30d1  2013/06/27 17:20:17
ofst[ 64]:err_code:0xa0404421, clock:0x195f30de  2013/06/27 17:20:30
ofst[ 68]:err_code:0xa0403034, clock:0x195f30de  2013/06/27 17:20:30
ofst[ 72]:err_code:0xa0404421, clock:0x195f3127  2013/06/27 17:21:43
ofst[ 76]:err_code:0xa0403034, clock:0x195f3127  2013/06/27 17:21:43
ofst[ 80]:err_code:0xa0404421, clock:0x195f3153  2013/06/27 17:22:27
ofst[ 84]:err_code:0xa0403034, clock:0x195f3153  2013/06/27 17:22:27
ofst[ 88]:err_code:0xa0404421, clock:0x195f315e  2013/06/27 17:22:38
ofst[ 92]:err_code:0xa0403034, clock:0x195f315e  2013/06/27 17:22:38
ofst[ 96]:err_code:0xa0404421, clock:0x195f316b  2013/06/27 17:22:51
ofst[100]:err_code:0xa0403034, clock:0x195f316b  2013/06/27 17:22:51
ofst[104]:err_code:0xa0404421, clock:0x195f31ab  2013/06/27 17:23:55
ofst[108]:err_code:0xa0403034, clock:0x195f31ab  2013/06/27 17:23:55
ofst[112]:err_code:0xa0404421, clock:0xffffffff
ofst[116]:err_code:0xa0403034, clock:0xffffffff
ofst[120]:err_code:0xa0404421, clock:0xffffffff
ofst[124]:err_code:0xa0403034, clock:0xffffffff
ofst[  0]:err_code:0xa0404421, clock:0xffffffff
ofst[  4]:err_code:0xa0403034, clock:0xffffffff
[mullion]$
Code:
becount
Bringup : 6831 times
Shutdown: 5080 times
Power-on: 216day 00hour 56min 36sec
[mullion]$
-----
Maybe it can be useful to someone, I know that error 3034 is serious, but maybe with this data something more can be done... THANK YOU
 
Typical RSX failure. 3034 goes along with 4421 and the BitTraining BE:RRAC:RX2:GLOBAL1:RX_STATUS error is triggered.

RSX 90nm bumps have gone bad. Can be BGA failure as well though I think that would result in a 1802 error, if I'm not mistaken.
 
Typical RSX failure. 3034 goes along with 4421 and the BitTraining BE:RRAC:RX2:GLOBAL1:RX_STATUS error is triggered.

RSX 90nm bumps have gone bad. Can be BGA failure as well though I think that would result in a 1802 error, if I'm not mistaken.
thanks for your prompt response.... I'm afraid it has no salvation... or does it have salvation?
 
Last edited:
In theory, it's fixable.

In practice... that's gonna be a long and potentially expensive road for a non-BC unit.

You have to replace the RSX with a 65nm or 40nm chip (CXD2982/2991 or CXD5300/5301), modify the RSX voltage generating circuit (for a lack of better wording - basically a few ICs that control the voltage fed into the RSX must be changed when swapping the 90nm RSXs for 65 or 40nm) then train the Syscon accordingly to accept the new RSX.

I personally would not recommend going that far for a unit that does not have the backwards compatibility hardware - especially since software emulation isn't perfect.
 
...should we still be using 3x 470uF caps when installing a 40nm on a frankenphat? or should we drop it to 330x3? with or without the extra MLCC caps? BC units

The frequency resopnse curve of the combined filter is affected by the choice in caps you make. 330 responds slightly differently. It'll work, but for convenience and because the extra capacitance (within reason) doesn't hurt, it's easier to use a blanket recommendation. Replace each tokin with 3x 470uF polymer caps. Can be TaPol or AlPol, but they need to be low ESR processor decoupling caps. That'll work by itself.

However the MLCCs bridge the gap and attempt to broaden the frequency response to account for the fact that the tokins were better at reaching and filtering higher frequiency noise/ripple that the TaPol and AlPol alone cannot. If you look at the way SONY transitioned to a polymer array, they used 470uF polymer (tantalum or Aluminum) AND 10uF/22uF MLCCs, in addition to the 0.1uF array. This is a more standard method nowadays. The proadlizers (tokins) were a neat product, but not necessary. And old now. As they die, we need an alternative filter array that fills the role they played. While Polymers alone will "work," it's not a 1:1 replacement. A fully populated Tantalizer is designed to be.

If you want to use 330uF, use 4x. That'll also "work."

Why my system got ylod'ed again and can i get rid of it with replacing more necs

And for my last question is there a way to get the error codes without the test pads (my console isn't running long me to open a web browser etc.)(I heard about some service ports or smth like that but i dunno where it is or even its eve usable?)

You'll have to repair the pads, connect to a VIA, or the syscon pins themselves.

As for what caused it to fail again, we'd just be guessing. You didn't post a picture of your work, so I'll assume your soldering wasn't great. A cold joint can cause that. If you didn't replace all the tokins, the bad apple could be spoiling the bunch. Or while disassembling/assembling you may have caused the MB to flex and break a connection. Like the BGA, or knocked off an SMD...we can't know without the codes.

BUT...If it YLOD is greater than 4s it's probably the caps (my guess).

...clean CECH-2001A slim...Ran great...130d run time. I tore it down, and delidded both chips...Now it doesn't start and I get the following in the error log:
# A0403034 2DD0C299
# A0404411 2DD0C298

I can see exposed copper where you likely nicked a trace. Can't see it well enough to be sure. And the paper towel is obscuring the view of the upper right hand corner. Those errors are common after a failed delid attempt and that model is not prone to them from natural causes.

Out of curiosity, what tool did you use to delid? I'd reccomend so modifications to prevent the chance it can dig into the substraite. Also, never start from the corners.

...can I replace my caps with a 1000uF and a 330uF TaPolys? I think the 1000uF one should be fine given its ESR/impedance characteristics (sorry, can't post a link yet, it's Kemet's) – I think they are very decent with an ESR of 5mOhm at critical freqs. The 330uF one is a bit worse at higher frequencies impedance-wise. I'd also throw in a couple of ceramic ones (100nF, 10uF, 20uF).

I just hope 3001 doesn't mean the board is dead ...

The 4x tokins have a combined ESR of 0.375 mOhms. As an EE I'm sure I don't need to remind you, but the reason you choose multiple 470uF low ESR (4.5 mOhms is ideal) is so that their combined ESR divides by the number of them used in parallel (4.5mOhms / 12 caps = 0.375 Combined ESR of the resulting filter array), so you can meet or exceed this specification. Using a single 1000uF of 5 mohms will result in a combined ESR that's higher than the original array. Which is within reason may not become an issue, it's just not recommended and those kemet caps are more expensive. Cost's more for worse performance.

...K with 423 days of uptime showing signs of NEC/TOKINs going bad. ...1002 codes...I have 330uf caps...Can I use these?
Use 4x 330uF per tokin replaced. Be aware that old caps that have been installed before may degrade when reflowed several times. Not such a big deal, but old caps from slims are still old. They can go bad too. New caps resets the timer.

...cechk04, DIA-002...gets a YLOD after a few seconds and turns off...I run bringup is a bunch of a0202121...
Code:
Power-on: 342day 17hour 51min 49sec

ofst[ 12]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05
ofst[ 16]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05
ofst[ 20]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05
ofst[ 24]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05
ofst[ 28]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05
ofst[ 32]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05
ofst[ 36]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05
ofst[ 40]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05
ofst[ 44]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05
ofst[ 48]:err_code:0xa0202121, clock:0x2de90d59  2024/05/28 21:47:05
ofst[ 52]:err_code:0xa0802021, clock:0x2de90d5f  2024/05/28 21:47:11
ofst[ 56]:err_code:0xa0801002, clock:0x2de90d5f  2024/05/28 21:47:11
10x A0202120 / 1x A0801002/A0802120 error combo is a newer reported one, but I've seen it be the RSX tokins. It seems to be more common in nonBC phat models due to a difference in how they report error codes. Your's is a K model (DIA-002). I believe I saw this same error in one of my DIA-002's. I used it as a pretext to steal it's 65nm RSX, instead of replacing the tokins which were what likely killed it. For me the 65nm RSX is more valuable to harvest and use to revive a BC model, than reviving the DIA-002. Even though it's a relatively easy diagnosis.

In this case, the 2120's do not indicate the HDMI or DVE IC, because the number of A0202120's is always 10x and followed by a 1002. Check the timestamps, they occur in the same YLOD event.

This is going to be tokins IMO.
...CECHK01 model, with about 11 days of uptime...After replacing the thermal paste, it acted like GLOD. ...Here are the codes.
00000000 A08014FF 2DF7E0D6
00000000 A0801701 2DF7E0D6
00000000 A0232102 2DF7D7F5
00000000 A0232102 2DF7D7F1
00000000 A0232102 2DF7D7ED
00000000 A0232102 2DF7D7E9
00000000 A0902120 2DF7D77B
00000000 A0403034 2DF7D77B
00000000 A0404002 2DF7D77B
00000000 A0902120 2DF7D775
[/CODE]
"after replacing thermal paste" it developed an error typically associated with BGA defects in this model. Check your method of breaking the suction between the HS and IHS. If you flex the Motherboard too much during that separation, it can cause BGA defects. It scares me every time I have to do it, but I use a plastic spudger to try and lever it up from the side, where it causes the board to flex less.

Also check for drop damage and any signs the board has been tampered with. If a BGA defect is suspected I always find it's impossible to rule out user error. In other words, I usually find evidence it didn't happen naturally from thermocycling. Sealed, errors predate any separation of HS from IHS, etc...

Likely, the GPU needs reballed.
I read Syscon and got error 3039, but there is not much described about this error, besides the beginning of the code being different from everything shown so far A052.

Model is CECHL04


Below is the full reading:
Code:
00000000 A0523039 FFFFFFFF
00000000 A05114FF FFFFFFFF
Any idea what it could be?
Facinating! That is an as yet unreported code! A new code!!!

These things get me excited. BAsed on the step number it's likely related to byte-training. This is the process immediatly after Bit-training, which is what fails in the case of Processor failures, like BGA or Bumps defects in either the CPU/GPU. This is when the processors' high speed interface on the FlexIO, th traces connecting them, are fed a series of bits or bytes, and the expected timing and response is measured to calibrate the phase lock loop and bring the alignment to a center position. This improves high speed signal integrity.

When there's an issue, like a poor solder joint, or a worn out die, this calibration procedure has a harder time completing, and when it finally can't...BEEP...BEEP...BEEP! Byte training come immediately after bit training. BitTraining causes 3034/4xxx errors at step# 40. ByteTraining causes 3035 errors at step# 50, and finally we've seen 3041 at step #52. That's the closest to your 3039 we've seen. We never got a resolution to it, nobody knows what caused it. But I have a guess and your's may help with this...

Your 3039 is right at the last stage before the bootloader. Between ByteTraining at step number 50 and when the StarShip 2 controller loads the bootloader stored on flash at step 60. Issues with the Nand/Nor cause A0603040 errors. But botth you 3039 and that 3041 occured at a previous step number of 52. The 14FF is a clue as to what's going on. That's a Checkstop. This is usually a hardware error during a software check. It discovered a fatal HW error during a check, and stopped because of it.So the poignant questions are...
  1. What comes after ByteTraining has determined the CPU and RSX are calibrated and ready for action? (assuming they are)
  2. What critical HW is needed for the bootloader to start?
I speculate it may be the Flash or controller itself, the starship 2 controller, because it's similar to 603040, but occuring earlier. So perhaps 3041 is an issue with the StarShip 2 controller. And perhaps your 3039 is too. It gives you a place to look at anyway. However, this is not going to be easy to troubleshoot.

Could you please provide as much information about the condition of the console as you can? Like describe if it was sealed, dusty, has evidence of drop damage, water damage, what work have you done, or what work can you see was done. Like, does it have new thermal paste? Is there flux residue around the RSX/CPU/ Has it been delided? And so on.

"I have a PS3 series 20 and have a NEC/Tokin connected to the RSX, NEC/Tokin with the series 0E108 totaling 4, if I replace it with NEC/Tokin series 0E128, can all NEC/Tokin series 0E108 be replaced with 0E128?"
It's exceedingly difficult to replace the proadlizer with a proadlizer. You will have to subject the board to the same temperatures needed to reflow the solder under the processor. It's more adviasble to remove them and replace with polymer caps soldered to a carrier board like the Tantalizer, because it makes soldering easier and can be done using an Iron. It won't risk the BGA. That was my main objective in making designing it.
 
Last edited:
Back
Top