PS3 (Research/Experimental) - NEC/TOKIN Capacitors Replacement - YLOD

Frequently Asked Questions

What is the YLOD?

It look like this...

What Causes the YLOD?
I wrote a much more in depth post about it here. But to summarize, SONY's official position on the YLOD is that it's a 'general hardware failure that needs to be diagnosed.' That is the truest statement...PERIOD! There are many causes and no single solution. However, there's no shortage of people attempting to simplify the issue down to a single problem.

This is the best video on the subject I've seen and it's well worth a watch. But be sure to come back and read the rest of this post, because it's easy to misunderstand what he means.
Displaced Gamer's video is the best researched IMO. And while is sounds like he's getting caught up in the NEX/TOKIN hype, it only "sounds" like he's against the reballing narrative. That not the case! He was only making the claim that it's not the ONLY solution. That the tokins can also cause it! And that "we shouldn't assume."

People want there to be a simple fix. One that's been hiding under our nose all this time. That's the underdog narrative that sheeple eat up like cake. It's the junk food of human psychology. It tastes good, but the flavor is devoid of nutrients. It's wishful thinking! The tokin narrative that say's "the BGA is fine," that narrative sounds good, but "The cake is a lie!" We will abandon the truth and latch onto any easier explanation. If it seems less expensive, or if it looks easy enough to DIY. It doesen't matter what the truth is, we will perform mental gymnastics to make believe. ANYTHING that allows us to believe something easier.

So what is the truth?

YLOD causes:
  • ~3% = NEC/TOKINs
  • ~7% = Blown fuse, short resistor, diode, bad IC, HDD, LAN adapter and associated SMD's fried by lightning strike, etc.
  • <1% = CPU's BGA, usually because of botched repair attempts or a dropped console.
  • ~90% = RSX issues
    • Most of which are BGA defects requiring a reball!
    • Bad bumps, RAM, internal damage, account for a smaller percentage of the RSX issues.

The #1 issue on this thread is that Heat causes false positive! That's why it's so confusing for people. It's why using a hair drier works (temprarily). It's why the hot air test on tokins works (temporarily). And it's why replacing tokins works (temporarily). The heat from those processes causes a thermo-mechanical bond that make it appear as thought the console was repaired when it wasn't. A "false positive" or "false repair." The heat warped the board and the microscopic BGA defect or bumps reconnected physically. It's like holding 2 wires together instead of soldering them. That's not repair! When the strain relaxes, the YLOD will come back. Usually in 2 weeks. I performed a statistical analysis, compiling every post on this thread into a spreadsheet, showing that replacing tokins does not fix the YLOD in the majority of consoles. That is backed up by testimonials by professionals that do volume repairs and have a much better feel for the percentage of consoles having bad tokins. People who's opinion's I have come to trust!

I have identified the following categories of YLOD. These can help you identify if your console is likely to have bad tokins. But this is not diagnostic! You still need the SYSCON errorlog to confirm the diagnosis:
  1. Warm-Start (Only starts when Warm)
  2. Cold-Start (Only starts when Cold)
  3. Instant (<1s)
  4. Non-Instant (1-10s)
  5. Delayed (10s - 5min)
  6. Random (Normal stress)
  7. Intense (High Stress, same place in certain games)
#'s 1-3 are not caused by bad tokins. RSX issues, PSU, shorts, faulty IC's like MOSFET's or their controller, and blown fuses are the most common causes. The SYSCON errorlog can provide clues to help make a diagnosis, but even then it's not 100%.

#4 can be caused by many things, including physically damaged tokins, like a botched repair attempt or an accident. You need the SYSCON errorlog to narrow down the list. I had a 4.5s YLOD on a COK-001 MB that was struck right on the corner of an RSX tokin by a cylindrical object in shipping. I know for a fact that MB had bad tokins and a good BGA before shipping. It was reballed professionally by @squeept before he sent the MB to me! Before shipping it to me, he was getting an "Intense YLOD" with 1002 SYSCON error codes. He confirmed the bad tokins on an oscilloscope! I did a bunch of testing on that console confirming the error codes associated with bad tokins and characterizing the filter attenuation by removing tokins, testing, adding TaPol capacitor arrays, and retesting. That's how we know what SYSCON error codes indicate bad tokins!

#'s 5-7 rule out alot of thing that cause the YLOD. These may very well be caused by bad tokins. If you have this kind of YLOD and it's not caused by an overheating issue, then the chances of a bad tokin are much higher. Not 100%, however! You still need to dump the SYSCON errorlog and confirm a 1002 to diagnose the problem.

Are my NEC/TOKINs are bad?
The difinitive way to know is by using an Oscilloscope. Here is what confirmed bad RSX tokins look like (RSX = Blue, CELL = Yellow).
Bad Tokins.jpg

That console had a 4.5s YLOD and A0801002 SYSCON error codes.

For comparison, this is what working tokins look like...
Working Tokins.jpg

If you don't have an oscilloscope, that's fine. If the NEC/TOKINs are bad enough to cause a YLOD, the SYSCON will generate a 1002 error code. However, 1002 can be caused by more than just the tokins. There is a PWM Buck Controller and other surface mount devices involved in the filter. Any one of them could cause this code. However, replacing the tokins usually clears up the problem.

Is my YLOD caused by Overheating? Can I fix it by deliding?
Overheating is a different LED code. Here's what it looks like (volume warning, his intro will blast you)
The fan ramps up and it starts flashing yellow (repeatedly) before it shuts down. A YLOD only flashes once, then flashes red repeatedly thereafter. Even if you were to confuse the two, the SYSCON errorlog can easily diagnose an overheating CPU (1200).

Most PS3's are getting old at this point and their thermal past as well. Most CPU's need to be delidded at this point, because that thermal interface compound (TIC) is worn out by now in the majority of consoles. Even the higest quality TIC's recommend replacement every 5-7 years. SONY wasn't worried about the console beyond the warranty period. So that becomes our problem now.

Be sure you are very carefull about how you delid. There are tiny traces than can be ruined if you use the wrong tool! Please read this post about how to make a CPU Deliding tool.

What are the SYSCON errorlogs?

"The PS3 syscon is the main power controller chip of the PS3." Among other things, it essentially watches over the PS3 and when something bad happens, such as when a YLOD occurs, it records an error code. The errorlog can hold 32 of these error codes.

Thanks to this thread by @db260179 we now have a method to read the codes! This has become the most important first step to diagnosing what wrong with your PS3! It's by far the cheapest place to start if you plan to attempt to fix your PS3 yourself. I made tutorial for windows that's a bit more friendly (IMO) than the other tutorials. The tutorial linked to in the original post (on github) is for Linux and requires a few steps that weren't very clear.

There is also an executable GUIin the github repository that makes it easier, but it's usefulness is limited. Perhaps in the future it can be expanded to be the de facto way to access the SYSCON, but for now I recommend doing it the manual way using my tutorial.

Will the SYSCON error logs dignose my PS3?

We are still learning what each code means and how to use them to diagnose PS3 issues. The best thing to do is ask questions in the SYSCON thread and perhaps someone will be able to help you figure it out.

Having said that we do know 3034 means RSX/CPU issues (almost always RSX) and that 1002 means nec/tokins. That's maybe 93% of the of the issues right there. The other 7% have cryptic codes that give us clues to the issue, but you probably still have to troubleshoot the board to figure out what's wrong (probe voltages and resistance with a multimeter). That can be tedious, but the more time that goes by the better we'll understand the codes. It's still fairly new to us!

Can the SYSCON errorlogs tell me if reballing is necessary?

3034/4xxx errors indicate CPU/RSX issues that cannont be fixed by any other method we have found. If you have a 3034, with or without the associate data error (4xxx), you need a reball. A reflow can work too, but it wont last as long and both put the chip/MB through heat stress that's hard on them. It's better to reball and make it last, since this heating process can't be repeated very many times before the chips burn out!

On the other hand, if you see a 1002 error that means you have bad nec/tokins and a reball is unnecessary.

I got a 3034 error from the SYSCON. Will reballing fix my console 100%? I want to be sure before doing it!

I have thoroughly read every page of this thread and spoken with professional PS3 repair persons who work on PS3 consoles in volume. The story is the same. The YLOD is primarily an RSX issue! 90% of YLOD's are due to this 3034 error! If you perform the SYSCON and get a 3034, you can be sure it's not in the other 10% of unrelated problems! You need a reball. But, that doesn't always mean there's a BGA defect causing it your issue! And a reball only fixes the BGA!

So no, we can't be 100% sure it'll fix your YLOD. There could be bad bumps, broken traces that the CPU/RSX use to communicate, and the die could just be worn out. However, reballing is the only way to know for sure! Alot of the time it fixes the issue. If you can find a reputable repair shop that knows what they're doing, that is!

How many times can my console be reflowed/reballed?
The goal of reflowing/reballing should be to get a strong BGA bond the first time! If you are contemplating a reflow because it lookes easy enough to DIY, please manage your expectations!

Leaded solder balls provide more durability and you don't get that from reflowing. Also reflows reuse the same old oxidized lead free solder. The defect could have been exposed to the air, grease, grime, lint, dirt, and hair for many years. The oxidation can be in the cracks and crevasse, often the pad itself. If the pad is oxidize you probably wont even get the solder to stick to it! Or if it does, it'll be brittle and break off easily again.

There are things you can do to improve your reflow, such as pre cleaning the board thoroughly to remove the grime and use good reflowing equipment under inert gas. But oxidized pads are impossible to prep chemically. Flux can help, but you really need to remove the chip and physically clean the pad to get them preped properly. So even the best reflow will not be as good as a reball.

Thermal stress is hard on processors and the fewer times you put the die through reflow temperatures the better. It was already through one reflow at the factory. That's fine, it's designed for it. But each time after that the these temperatures harm the chip and motherboard. If you have to subject the chip to these extreme temperatures is should be done right and not be needed again! A reflow might get you back up for a year, but then you'll need another one. How many times can you do this?

In my experiance, not more that a couple 2-3 times before the chip dies or the BGA pads on the motherboard start lifting off the board. Reflowing is just a bad idea. I have personally practiced reballing on a donor motherboard and after about 3 reflow cycles the BGA pads on the motherboard begin breaking off! Also the layers can start to de-laminate. Clearly, this cannot be repeated indefinitely! Like I said, the goal is to get it repaired once and enjoy it while it lasts. If it dies again, it's better to replace the chip. It could be that the reason it died again is not the BGA, but the die or bumps. Replacing the chip saves the motherboard the stress of another pointless reflow cycle. So my answer to the question of how many times you can reflow/reball is once! After that your best option is to replace the chip! Beyond those 2 thermal cycles, repeated attempts are likely to cause permanent damage to the motherboard. A skilled repair shop might be able to salvage the console another time or 2, but the motherboard is on it's last leg.

And don't be the guy that reflow's a backwards compatible PS3 just to resells it as working! That's such a dick move! First how would you like to buy a DIY reflow for $300 just to have it YLOD a year later? Come on man, pay it forward! There are only about 6 million backwards compatible consoles out there and doing this increases the likelihood that console will not be able to have successful reball in the future. That's if you don't irreparably damage it in the process! It's better to sell it to someone willing to reball it. For the sake of video game preservation! These consoles deserve to be enjoyed for as long as possible.

My YLOD turned into a GLOD! What does that mean?
BGA defects can cause either a GLOD or a YLOD. I have seen a YLOD turn into a GLOD after a tokin replacement, due to thermomechanical false positives. The BGA pad that were causing the YLOD reconnected from thermal warping, but there was still a break that caused the GLOD. Or you could change the mounting ptessure when disassembling or installing fresh thermal paste. The solder cracks are microscopic, so it doesn't take much.

There are other issues that can cause a GLOD, such as GPU/CPU RAM, OS corruption, software problems, and a handfull of power ICs. But usually these would be a GLOD always, not change from a YLOD to a GLOD. However, @botakompong's method of troubleshooting PS3's can help you diagnose that type of ambiguous fault.

My console died 1 year after having it reballied? Why?
Many reballers are shady. I've heard of people paying for a leaded reball and only getting a reflow! And some don't even clean the Motherboard afterwards, leaving flux everywhere. Very unprofessional! I'd bet that many of the "reballed" consoles out there weren't actually reballed. And even if they were they may not have been done well.

Is reballing with lead solder balls permanent?
The same problem that caused your solder balls to crack in the first place can cause it again. Anyone telling you otherwise is using "puffery" to sell you on the idea. Yes, leaded solder is more durable than lead-free, but eventually leaded solder will deform and crack too. It's about thermal cycles. Leaded balls can withstand more, that's all.

There are a lot of factors that go into getting a strong bond, such as the condition of the motherboard, drying, prepping the BGA properly, the reflow profile, cleaning, etc. Since SONY is not offering these services anymore, the quality of the reball you will get is inconsistent. That's probably why there are so many complaints that reballs fail.

Let's say I get a good reball from a trusted shop. Will that be permanent?
No! Yes, it'll last longer than it would have otherwise. However, permanent is a relative term. How long is permanent? For the rest of your natural life? If that's how you define "permanent," then no electronic device would qualify. We tend to think of electronics as indestructible, but they are perishable. Capacitors degrade, hard drives fail, SSD's only have so many read/write cycles, processors degrade, solid state laser wear out, disc's rot, cartridges wear out, batteries die, metals rust, etc. But that's probably not what you meant.

You probably mean, "will it remove the defect that cause PS3's to fail prematurely?" That issue is with Modern CPU/GPU's. Namely, that they run hot. The more the temperature swings, the more strain this places on the chip. Eventually the solderballs deform, crack, or pull away from their pads. This is avoidable! FCBGA isn't the only technology that can be used on modern processor. It's the most profitable. Unfortunately, the electronics industry has chosen to use Flip Chip Ball Grid Array (FCBGA) to connect CPU/GPU's in consumer electronics, such as laptops and video game consoles. It's cheaper than using sockets and it prevents you from upgrading/repairing your own device. It's an anti-consumer/anti-competitive practice that should be illegal, because it increases repair costs, puts small 3rd party repair shops out of business, and increases e-waste. Every country NEEDS to stay technologically competitive, so electronics companies have massive leverage and use it to obtain favorable legal decisions. Such as squashing "right to repair" laws, for example. "If you don't like it, fine! No SONY products for you!" That's a bigger social problem.

It's not all the BGA either. A socket provides strain relief. That's why your cpmputers CPU lasts longer than your graphics card's GPU, which is soldered on with a BGA. However, even socketed CPU fail. Again, that's a function of heat. The more of it, the faster it dies. Maybe you get ten years out of it. Cool, just buy another. That only works when it's in a socket and the part is available. We don't have that with a PS3. When the die wears out, that chip has to come off. Inthe case of the PS3's GPU (RSX), it can be replaced. But the Cell cannot (not easily).

Since a reball is not an option for me (or most of us), is there a guide that shows the proper reflow procedure?

It's probably easier to start with the wrong way to do it:
What she did wrong:
  • You want to use AMTEC-NC559 Flux. This is designed with SAC (Lead Free alloy) only! It will not work for lead alloys! The AMTECH flux is to reflow the Lead Free solder or remove the chip only.
  • If you are reballing with leaded solder balls, you need a different flux to put the chip back on! I like the KINGBOW BGA flux for that. It came with a cheap BGA reballing kit I bought on ebay. It might be cheap, but it's great flux. However, if you're just doing a reflow, you want the AMTECH flux!
  • The MB needs to be thoroughly cleaned before and after the reflow. The area underneath the RSX is especially important to clean, as contamination will get into the BGA and reduce the durability of the reflow!
  • Als flux residues can actually be slightly conductive, which reduced resistance between VDD/GND on the BGA. And since thos resistance is alread very small for processors, it can be significant. I've measured 2.4 ohms after a reball, then 3.1 after cleaning the flux off. If your RSX isn't as healthy as mine and is closer to 2, then the flux residue could lower it to 1ish, and that may not be enough separation for it to work!
  • Preheating the board as she demonstrates is not going to work. Moisture from humidity in the air has seeped inside. It's trapped microscopic pores between motherboard layers, SMD components, underfill of GPU, CPU, RAM, everywhere. You MUST thorougly dry the motherboard or the steam will force it's way out explosively. This is called popcorning! Ive seen component pop and crack! I've seen streams of bubbling liquid escaping out from the underfill of the the RSX DIE! This can squeeze the solder bumps out!
  • The PS3 motherboard needs to be dried at 100C for 2-4 hours before a reflow! NO EXCEPTION!
  • A bottom heating element is essential to prevent the motherboard from warping. When you apply heat on one side of the motherboard it bows. In this case the topside expands, and since the bottom side isn't heated, it's cooler and expands less. So the motherboard literally warps! When it cools, it contracts and pulls on the freshly solidified BGA. So even if you achieve a good reflow, the stress of this bowing back can literally rip the BGA pads off the motherboard or break the solder from them! It can delaminate the layers of the board too.
  • You need a bottom heater to preheat the entire motherboard and control the cooldown so that both sides heat-up and cool-down together, preventing thermal warping. This will achieve a stress free bond.
  • She says to set the heat gun to a certain temperature and to heat for a set period of time. This is a terrible way to estimate temperature. Some boards may flow, some may not! There is no way to know for sure if you even melted the solder. And if you did, there's no way to know if you burned it to a crisp in the process!
  • You need temperature probes to help you reach the proper reflowing temperatures to melt SAC lead free alloy (218C). Reflow profiles, like the one pictured below, are designed to have a preheat ramp, reflow, and cooldown period that minimize crystal formation.
    tacky-assembly-flux-12-728.jpg
  • Profiles control the temperature carefully to achieve a strong joint. The way she did it was THE WORST way you could go about it. Even if it worked, the solder balls that formed will be full of long crystals that make it brittle and prone to cracks! Solder that is shiny has small crystals. If it appears matte, it has long crystals and is hard/brittle. They longer you hold the heat on the solder, the more time they have to grow, and the more time the solder has to oxidize, which further makes it hard/brittle. ESPECIALLY lead Free alowy! IFixIt's method will not last very long before the YLOD will return.

This is the guy/video that inspired my DIY ghetto setup. It's capable with practice...

A reflow is the same procedure, except instead of removing the chip you just begin the cooling cycles after nudging to ensure it has flowed. The setup cost me about $300, but it's not ideal, because I have to manually simulate the reflow profile. I'm currently looking into making my own automated system using an IR6500 top heater, relays, and PID controllers...

However, a complete BGA rework station is probably cheaper overall when I add up all the different pieces. So if you haven't already gone down the path of buying rework equipment, then a dedicated BGA rework station is the easier solution. The reason I've gone this route is that the different pieces in my setup are useful for other electrics projects, besides reballing.

I realize these tools are expensive and that no one wants to spend more on tools than another PS3! This is why reballing services actually aren't asking too high a price at $100-150. The problem is finding one you can trust, if you can find one at all!

Okay, but that sounds expensive! What's the bare minimum I need to spend to do a good enough reflow?
  1. 99% Isoproply alcohol
  2. AMTECH FLUX
  3. A temperature controlled Hot Air wand
  4. An IR Preheater
  5. Thermometer with 2 K-type probes.
  6. Experiance!!!
That's going to cost you around $200 minimum! If you try the "heat gun special" like that IFixIt video, you are more likely to damage the console than you are to fix it. And even if it does work, it won't for long. Worst of all, that method will reduce the chances a future repair can fix the console. So the console will likely become e-waste in a few months when the traces rip from the pads, or the layers delaminate, etc.

Why doesn't the CPU (Cell) fail?

It does!

Why does the GPU (RSX) fail more often than the CPU (Cell)?
Only Nvidia and SONY know for sure. They have access to CAD tools that can model and simulate the stresses on the motherboard/CPU/RSX interface. These tools can predict how many thermal cycles it takes to develop a BGA crack, for example. This is rutine work done by enginners during the design process and can be adjusted then. For example, they can increase the fan curve to be more aggressive if more cooling is needed to prevent premature failure. Or they can redesign the heat sink, use more powerful fan, different thermal compound, etc. And they have to weight other considerations. They often prioritize fan noise over thermals, for example. People will just criticize the system for being too loud, instead of thanking them for the extra reliability. This is happening with the PS5 right now.

While we don't have access to the CAD tools SONY and Nvidia do, we speculate that the RSX undergoes more thermal cycles, because of it's workload, than does the Cell. While maximum temperature is important, the change in temperature (Delta T) is even more important! On a well cooled stock backwards compatible PS3, the CPU often runs 69-75C. Yes it changes from 25-70C when you turn it on and from 75C-25C when you turn it off, but that only happen once per session. Most of the time, the CPU is running at a constant temperature in the 70s, not changing up/down all that much.

By contrast, the RSX can run much cooler when it's not being used much and hotter when used heavily. Say, 63-79C with many more swings up and down as you play a game. Because of this, it experiences many more thermal cycles, which over time deform the BGA and bumps. Also, that heat wears the die out faster.

For more information about how SONY knew (or should have known) about these issues, check out this post and this one where I break it down further. The ensuing discussion is also entertaining.

What happens when my RSX dies? Can I get a new one installed?
Any RSX can be replaced with the same model, assuming the replacement works. New old stock is the limiting factor. All the NOS RSX's for backwards compatible PS3's were bought up and installed long ago. So you aren't going to get a "new" one installed. You can get a used one, but they are designed to be subjected to reflow temperature only once (when installed at the factory). They can be removed, ohm tested, and reinstalled into another machine. This does work well, however it certainly damages the RSX some. Now, that doesn't mean you can't get another ten years out of it, if done properly, and kept nice an cool afterwards. With leaded solder and fresh high quality thermal paste it can last a long time. Probably longer than it did stock! However, finding "new" old stock would definitely be better, because you're installing it for the first time (how it was designed).

So finding replacements has been the issue. That is until about 5 years ago! See the next question for details...,

What's this Frankenstein Mod?
About a year ago we learned about official SONY refurbished consoles that had different model GPU's installed on them. So far 4 console we know about have been found. Apparently, around 2012 SONY repair shops began replacing the stock 90nm RSX on backward compatible PS3's with 65nm and 40nm RSX's. Around that time the slim consoles were being manufactured and sourcing new old stock 90nm RSX's became impossible. So to repre these console's SONY had to figure out a way to install the newer 65nm, and later 40nm, RSX chips. We called this the "Frankenstein Mod" for obvious reasons and have been attempting to replicate it since 2020, without success. It involves replacing/reprograming the SYSCON chip itself (the part we're still figuring out), among other mods. So it's very difficult and time consuming.

However, 4-5 years before we even learned this was possable, @botakompong's brother "Kiaw" made a modchip that allows it! He and @botakompong could interchange 90nm, 65nm, and 40nm RSX's by just installing a chip and moving a few resistors! That wasn't possible before, because the console would recognize there wasn't a 90nm chip and refuse to boot (YLOD). The mod chip spoofs the RSX_ID and tricks the system into booting. An amazing accomplishment!

Unfortunately, Kiaw passed away a few year ago and we never got to pick his talented brain about the modchip. His brother @botakompong has carried on the torch selling the modchip and installing it at his shop in Jakarta, Indonesia. @botakompong had been offering this service for years before we found out about it in the west! We only learned about it last year. The modchip is only available through forwarding services ATM, making it hard to get. On fiverr I used "lusianaliu" who bought the modchip from this listing and shipped them to me. It's a bit inconvenient, but works.

@botakompong has alot of experiance with the mod and repiring consoles in general, so he's a wealth of knowledge, but he's not the engineer Kiaw was. We're still trying to regain the knowledge that was lost when Kiaw died. The specifics about how the mod works are not fully understood (by us at least).

Regardless, this mod chip enables us to replace the 90nm RSX with a much more reliable 65nm or even 40nm model. The 40nm RSX has no trouble staying at 54C in my A model, under intense load (ambient 25C)! So this is the ultimate solution! We can still source NOS 40nm RSX's! And there are many, MANY more Slim PS3's than there are BC models. This modchip makes sourcing replacement RSX's for BC PS3's sustainable. Every single BC model could easily get a 40nm RSX, since there are so many more slims out there! We don't have to throw them away anymore!
 
Last edited:
Frequently Asked Questions

Nice effort, dude. Needs posted at the top of every new page at this point. I still say this thread needs either removed or edited by the mods, OP ain't touching the post. I saw something somewhere (reddit?) recently where he still stood by the 90% lie, but I think he at least actually ceded the "use the syscon" point. I mean, I had to trap him in to admitting he didn't have a capacitance meter, and It's kind of nuts at this point that this much misinformation is still allowed so prominently here.

The only thing I'd clarify is that since no professionals (that work on these) have an x-ray machine, there's no way to verify that people didn't accidentally fix bad bumps when they do any rework. Inspection scopes/mirrors only show the outer rows, and while that's most likely where there would be any ball defects, it's far from definitive. And with my estimate of the prevalence of compensated, mechanically connected BGA issues being damn near 100% on un-fixed BC consoles at this point, you're going to see a BGA defect even if it's not the thing causing the problem. Not really a point that matters in any realistic way, but I'm a pedant. I certainly don't care if I accidentally fixed a bump when I sell them, as reliability after stress testing is damn decent. I wouldn't warranty them for a full year if I didn't trust the work. I like to blame any testing failures on bumps, but that's just a guess. My understanding of electromigration and other actual silicon defects is that rework would rarely have any impact, and it would never make it past real stress testing.

I've had that Rossman video thrown in my face more times than I can count. I like the dude, I really do, but he's not always right. And since he doesn't have any real background in the subject, there's a lot of nuanced problems that he struggles with. "It's never the coil" except it's pretty often the coil in things I specifically deal with....so.... then when he's talking about "reballing flip chips is bullshit" he's talking about Apple, because that's what he works on. He's talking about how it's dumb to reball one specific chip in his niche specialty, especially when there are replacements available because it's not a custom chip. One specific chip can have production issues with the bumps and be WAY more prone to failure than any other similar chip. Remember the capacitor plague around 2005 or so? EVERYTHING had bad capacitors. Or Sega stuff from the early 90's. Otherwise? I rarely change caps unless they actually, measurably failed. My Sony TA-2650 amp has perfect capacitors from 1975. Yeah, I'm not surprised if a cap next to a heatsink popped. Some SPECIFIC capacitors are problematic. I mean, the early fat model PS4 systems have BGA defects FROM THE FACTORY because something went wrong in production. Bad inert gas was my guess for the lawsuit that never happened. And they made it past QC. Clean off the oxidation then reball, and they hardly ever come back for warranty repairs.

I forgot what my point was. I'm getting a root canal in the morning. Brush your teeth.
 
Last edited:
Frequently Asked Questions

What is the YLOD?


What Causes the YLOD?
SONY's official position on the YLOD is that it's a 'general hardware failure that needs to be diagnosed.' That is the truest statement...PERIOD! There are many causes and no single solution. Howver there's no shortage of people attempting to simplify the issue down to a single problem. This is the best video on the subject I've seen and it's well worth a watch. But be sure to come back and read the rest of this post, because it's easy to misunderstand what he means.
Displaced Gamer's video is the best researched IMO. And while is sounds like he's getting caught up in the NEX/TOKIN hype, it only "sounds" like he's against the reballing narrative. That not the case! He was only making the claim that it's not the ONLY solution. That the tokins can also cause it! And that "we shouldn't assume."

People want there to be a simple fix. One that's been hiding under our nose all this time. That's the underdog narrative that sheeple eat up like cake. It's the junk food of human psychology. It tastes good, but the flavor is devoid of nutrients. It's wishful thinking! The tokin narrative that say's "the BGA is fine," that narrative sounds good, but "The cake is a lie!" We will abandon the truth and latch onto any easier explanation. If it seems less expensive, or if it looks easy enough to DIY. It doesen't matter what the truth is, we will perform mental gymnastics to make believe. ANYTHING that allows us to believe something easier.

So what is the truth?

YLOD causes:
  • ~3% = NEC/TOKINs
  • ~7% = Blown fuse, short resistor, diode, bad IC, HDD, LAN adapter and associated SMD's fried by lightning strike, etc.
  • <1% = CPU's BGA, usually because of botched repair attempts or a dropped console.
  • ~90% = RSX issues
    • Most of which are BGA defects requiring a reball!
    • Bad bumps, RAM, internal damage, account for a smaller percentage of the RSX issues.

The #1 issue on this thread is that Heat causes false positive! That's why it's so confusing for people. It's why using a hair drier works (temprarily). It's why the hot air test on tokins works (temporarily). And it's why replacing tokins works (temporarily). The heat from those processes causes a thermo-mechanical bond that make it appear as thought the console was repaired when it wasn't. A "false positive" or "false repair." The heat warped the board and the microscopic BGA defect or bumps reconnected physically. It's like holding 2 wires together instead of soldering them. That's not repair! When the strain relaxes, the YLOD will come back. Usually in 2 weeks. I performed a statistical analysis, compiling every post on this thread into a spreadsheet, showing that replacing tokins does not fix the YLOD in the majority of consoles. That is backed up by testimonials by professionals that do volume repairs and have a much better feel for the percentage of consoles having bad tokins. People who's opinion's I have come to trust!

I have identified the following categories of YLOD. These can help you identify if your console is likely to have bad tokins. But this is not diagnostic! You still need the SYSCON errorlog to confirm the diagnosis:
  1. Warm-Start (Only starts when Warm)
  2. Cold-Start (Only starts when Cold)
  3. Instant (<1s)
  4. Non-Instant (1-10s)
  5. Delayed (10s - 5min)
  6. Random (Normal stress)
  7. Intense (High Stress, same place in certain games)
#'s 1-3 are not caused by bad tokins. RSX issues, PSU, shorts, faulty IC's like MOSFET's or their controller, and blown fuses are the most common causes. The SYSCON errorlog can provide clues to help make a diagnosis, but even then it's not 100%.

#4 can be caused by many things, including physically damaged tokins, like a botched repair attempt or an accident. You need the SYSCON errorlog to narrow down the list. I had a 4.5s YLOD on a COK-001 MB that was struck right on the corner of an RSX tokin by a cylindrical object in shipping. I know for a fact that MB had bad tokins and a good BGA before shipping. It was reballed professionally by @squeept before he sent the MB to me! Before shipping it to me, he was getting an "Intense YLOD" with 1002 SYSCON error codes. He confirmed the bad tokins on an oscilloscope! I did a bunch of testing on that console confirming the error codes associated with bad tokins and characterizing the filter attenuation by removing tokins, testing, adding TaPol capacitor arrays, and retesting. That's how we know what SYSCON error codes indicate bad tokins!

#'s 5-7 rule out alot of thing that cause the YLOD. These may very well be caused by bad tokins. If you have this kind of YLOD and it's not caused by an overheating issue, then the chances of a bad tokin are much higher. Not 100%, however! You still need to dump the SYSCON errorlog and confirm a 1002 to diagnose the problem.

Is my YLOD caused by Overheating? Can I fix it by deliding?
Overheating is a different LED code. The fan ramps up and it starts flashing yellow (repeatedly) before it shuts down. A YLOD only flashes once, then flashes red repeatedly thereafter. Even if you were to confuse the two, the SYSCON errorlog can easily diagnose an overheating CPU (1200).

Most PS3's are getting old at this point and their thermal past as well. Most CPU's need to be delidded at this point, because that thermal interface compound (TIC) is worn out by now in the majority of consoles. Even the higest quality TIC's recommend replacement every 5-7 years. SONY wasn't worried about the console beyond the warranty period. So that becomes our problem now.

Be sure you are very carefull about how you delid. There are tiny traces than can be ruined if you use the wrong tool! Please read this post about how to make a CPU Deliding tool.

What are the SYSCON errorlogs?

"The PS3 syscon is the main power controller chip of the PS3." Among other things, it essentially watches over the PS3 and when something bad happens, such as when a YLOD occurs, it records an error code. The errorlog can hold 32 of these error codes.

Thanks to this thread by @db260179 we now have a method to read the codes! This has become the most important first step to diagnosing what wrong with your PS3! It's by far the cheapest place to start if you plan to attempt to fix your PS3 yourself. I made tutorial for windows that's a bit more friendly (IMO) than the other tutorials. The tutorial linked to in the original post (on github) is for Linux and requires a few steps that weren't very clear.

There is also an executable GUIin the github repository that makes it easier, but it's usefulness is limited. Perhaps in the future it can be expanded to be the de facto way to access the SYSCON, but for now I recommend doing it the manual way using my tutorial.

Will the SYSCON error logs dignose my PS3?

We are still learning what each code means and how to use them to diagnose PS3 issues. The best thing to do is ask questions in the SYSCON thread and perhaps someone will be able to help you figure it out.

Having said that we do know 3034 means RSX/CPU issues (almost always RSX) and that 1002 means nec/tokins. That's maybe 93% of the of the issues right there. The other 7% have cryptic codes that give us clues to the issue, but you probably still have to troubleshoot the board to figure out what's wrong (probe voltages and resistance with a multimeter). That can be tedious, but the more time that goes by the better we'll understand the codes. It's still fairly new to us!

Can the SYSCON errorlogs tell me if reballing is necessary?

3034/4xxx errors indicate CPU/RSX issues that cannont be fixed by any other method we have found. If you have a 3034, with or without the associate data error (4xxx), you need a reball. A reflow can work too, but it wont last as long and both put the chip/MB through heat stress that's hard on them. It's better to reball and make it last, since this heating process can't be repeated very many times before the chips burn out!

On the other hand, if you see a 1002 error that means you have bad nec/tokins and a reball is unnecessary.

I got a 3034 error from the SYSCON. Will reballing fix my console 100%? I want to be sure before doing it!

I have thoroughly read every page of this thread and spoken with professional PS3 repair persons who work on PS3 consoles in volume. The story is the same. The YLOD is primarily an RSX issue! 90% of YLOD's are due to this 3034 error! If you perform the SYSCON and get a 3034, you can be sure it's not in the other 10% of unrelated problems! You need a reball. But, that doesn't always mean there's a BGA defect causing it your issue! And a reball only fixes the BGA!

So no, we can't be 100% sure it'll fix your YLOD. There could be bad bumps, broken traces that the CPU/RSX use to communicate, and the die could just be worn out. However, reballing is the only way to know for sure! Alot of the time it fixes the issue. If you can find a reputable repair shop that knows what they're doing, that is!

My YLOD turned into a GLOD! What does that mean?
BGA defects can cause either a GLOD or a YLOD. I have seen a YLOD turn into a GLOD after a tokin replacment, due to thermomechanical false positives. The BGA pad that were causing the YLOD reconnected from thermal warping, but there was still a break that caused the GLOD. Or you could change the mounting ptessure when disassembling or installing fresh thermal paste. The solder cracks are microscopic, so it doesn't take much.

There are other issues that can cause a GLOD, such as GPU/CPU RAM, OS corruption, software problems, and a handfull of power ICs. But usually these would be a GLOD always, not change from a YLOD to a GLOD. However, @botakompong's method of troubleshooting PS3's can help you diagnose that type of ambiguous fault.

My console died 1 year after having it reballied? Why?
Many reballers are shady. I've heard of people paying for a leaded reball and only getting a reflow! And some don't even clean the Motherboard afterwards, leaving flux everywhere. Very unprofessional! I'd bet that many of the "reballed" consoles out there weren't actually reballed. And even if they were they may not have been done well.

Is reballing with lead solder balls permanent?
The same problem that caused your solder balls to crack in the first place can cause it again. Anyone telling you otherwise is using "puffery" to sell you on the idea. Yes, leaded solder is more durable than lead-free, but eventually leaded solder will deform and crack too. It's about thermal cycles. Leaded balls can withstand more, that's all.

There are a lot of factors that go into getting a strong bond, such as the condition of the motherboard, drying, prepping the BGA properly, the reflow profile, cleaning, etc. Since SONY is not offering these services anymore, the quality of the reball you will get is inconsistent. That's probably why there are so many complaints that reballs fail.

Let's say I get a good reball from a trusted shop. Will that be permanent?
No! Yes, it'll last longer than it would have otherwise. However, permanent is a relative term. How long is permanent? For the rest of your natural life? If that's how you define "permanent," then no electronic device would qualify. We tend to think of electronics as indestructible, but they are perishable. Capacitors degrade, hard drives fail, SSD's only have so many read/write cycles, processors degrade, solid state laser wear out, disc's rot, cartridges wear out, batteries die, metals rust, etc. But that's probably not what you meant.

You probably mean, "will it remove the defect that cause PS3's to fail prematurely?" That issue is with Modern CPU/GPU's. Namely, that they run hot. The more the temperature swings, the more strain this places on the chip. Eventually the solderballs deform, crack, or pull away from their pads. This is avoidable! FCBGA isn't the only technology that can be used on modern processor. It's the most profitable. Unfortunately, the electronics industry has chosen to use Flip Chip Ball Grid Array (FCBGA) to connect CPU/GPU's in consumer electronics, such as laptops and video game consoles. It's cheaper than using sockets and it prevents you from upgrading/repairing your own device. It's an anti-consumer/anti-competitive practice that should be illegal, because it increases repair costs, puts small 3rd party repair shops out of business, and increases e-waste. Every country NEEDS to stay technologically competitive, so electronics companies have massive leverage and use it to obtain favorable legal decisions. Such as squashing "right to repair" laws, for example. "If you don't like it, fine! No SONY products for you!" That's a bigger social problem.

It's not all the BGA either. A socket provides strain relief. That's why your cpmputers CPU lasts longer than your graphics card's GPU, which is soldered on with a BGA. However, even socketed CPU fail. Again, that's a function of heat. The more of it, the faster it dies. Maybe you get ten years out of it. Cool, just buy another. That only works when it's in a socket and the part is available. We don't have that with a PS3. When the die wears out, that chip has to come off. Inthe case of the PS3's GPU (RSX), it can be replaced. But the Cell cannot (not easily).

Why doesn't the CPU (Cell) fail?

It does!

Why does the GPU (RSX) fail more often than the CPU (Cell)?
Only Nvidia and SONY know for sure. They have access to CAD tools that can model and simulate the stresses on the motherboard/CPU/RSX interface. These tools can predict how many thermal cycles it takes to develop a BGA crack, for example. This is rutine work done by enginners during the design process and can be adjusted then. For example, they can increase the fan curve to be more aggressive if more cooling is needed to prevent premature failure. Or they can redesign the heat sink, use more powerful fan, different thermal compound, etc. And they have to weight other considerations. They often prioritize fan noise over thermals, for example. People will just criticize the system for being too loud, instead of thanking them for the extra reliability. This is happening with the PS5 right now.

While we don't have access to the CAD tools SONY and Nvidia do, we speculate that the RSX undergoes more thermal cycles, because of it's workload, than does the Cell. While maximum temperature is important, the change in temperature (Delta T) is even more important! On a well cooled stock backwards compatible PS3, the CPU often runs 69-75C. Yes it changes from 25-70C when you turn it on and from 75C-25C when you turn it off, but that only happen once per session. Most of the time, the CPU is running at a constant temperature in the 70s, not changing up/down all that much.

By contrast, the RSX can run much cooler when it's not being used much and hotter when used heavily. Say, 63-79C with many more swings up and down as you play a game. Because of this, it experiences many more thermal cycles, which over time deform the BGA and bumps. Also, that heat wears the die out faster.

For more information about how SONY knew (or should have known) about these issues, check out this post and this one where I break it down further. The ensuing discussion is also entertaining.

What happens when my RSX dies? Can I get a new one installed?
Any RSX can be replaced with the same model, assuming the replacement works. New old stock is the limiting factor. All the NOS RSX's for backwards compatible PS3's were bought up and installed long ago. So you aren't going to get a "new" one installed. You can get a used one, but they are designed to be subjected to reflow temperature only once (when installed at the factory). They can be removed, ohm tested, and reinstalled into another machine. This does work well, however it certainly damages the RSX some. Now, that doesn't mean you can't get another ten years out of it, if done properly, and kept nice an cool afterwards. With leaded solder and fresh high quality thermal paste it can last a long time. Probably longer than it did stock! However, finding "new" old stock would definitely be better, because you're installing it for the first time (how it was designed).

So finding replacements has been the issue. That is until about 5 years ago! See the next question for details...,

What's this Frankenstein Mod?
About a year ago we learned about official SONY refurbished consoles that had different model GPU's installed on them. So far 4 console we know about have been found. Apparently, around 2012 SONY repair shops began replacing the stock 90nm RSX on backward compatible PS3's with 65nm and 40nm RSX's. Around that time the slim consoles were being manufactured and sourcing new old stock 90nm RSX's became impossible. So to repre these console's SONY had to figure out a way to install the newer 65nm, and later 40nm, RSX chips. We called this the "Frankenstein Mod" for obvious reasons and have been attempting to replicate it since 2020, without success. It involves replacing/reprograming the SYSCON chip itself (the part we're still figuring out), among other mods. So it's very difficult and time consuming.

However, 4-5 years before we even learned this was possable, @botakompong's brother "Kiaw" made a modchip that allows it! He and @botakompong could interchange 90nm, 65nm, and 40nm RSX's by just installing a chip and moving a few resistors! That wasn't possible before, because the console would recognize there wasn't a 90nm chip and refuse to boot (YLOD). The mod chip spoofs the RSX_ID and tricks the system into booting. An amazing accomplishment!

Unfortunately, Kiaw passed away a few year ago and we never got to pick his talented brain about the modchip. His brother @botakompong has carried on the torch selling the modchip and installing it at his shop in Jakarta, Indonesia. @botakompong had been offering this service for years before we found out about it in the west! We only learned about it last year. The modchip is only available through forwarding services ATM, making it hard to get. And it's still a bit experimental. @botakompong has alot of experiance with the mod and repiring consoles in general, so he's a wealth of knowledge, but he's not the engineer Kiaw was. We're still trying to regain the knowledge that was lost when Kiaw died.

We know for sure the mod chip works on COK-002 motherboards! So if you have C/E model you're good to go! We're not sure about COK-001 yet (A/B models). I have a 40nm RSX on a Fully Backwards Compatible A model myself. It plays PS1 and PS3 games fine. However, it's not playing PS2! Kinda defeats the purpose for me, but we still don't know if that's an issue with the mod or my console specifically. So far, I'm the only one who has successfully attempted it on a COK-001 MB, at least that we know about. Once @botakompong is able to get back to the shop (he's been on covid lockdown), he'll start testing to see if he can replicate the issue. We just need it installed on more COK-001's to know if this is an isolated issue!

Regardless, this mod chip enables us to replace the 90nm RSX with a much more reliable 40nm. The 40nm RSX has no trouble staying at 54C in my A model, under intense load! If it turns out that the PS2 issue is just due to faulty HW on my console, and not an issue with the modchip, then this is the ultimate solution! We can still source NOS 40nm RSX's! And there are many, many, more Slim PS3's than BC models. This modchip makes sourcing replacement RSX's for BC ps3 possible. We don't have to throw them away anymore!
That's a lot of work. Well done.

Though I'm afraid that this won't reach those that actually need this information... The way things are right now. But oh well, maybe things can change in the future.

The only "thing" where I'll slightly differ is that you only reball "once". That's the whole point.
You don't really reball just because you want it to last "longer". It's because of the certainty you get. Also known as peace of mind.
If a properly reballed machine fails again after like 1 year or less as you say... You don't need to reball again because you can be sure that the balls are fine, if you weren't sure before.
That's when you know that your next step is a replacement.
And this is why "reflows" or other half assed methods are not helpful. Not simply because they'll "last less"... But because they are a downward spiral and provide no certainty at all. In the end, the board will be totally unrepairable and you'll still be wondering why.

Because the truth is, when we diagnose the overwhelmingly common "RSX issues"... We aren't just talking about the balls under the chip. Oh no. As you quickly mentioned, the problem can also be on top of the chip.
The real ratio? I wish I knew. But people quickly like to disregard one possibility over the other. Which is never good.

Now as the name suggests, Reballing can only really fix the balls. It's not magic either. But it's still the only "definitive" way to say if the chip is good or bad.
This is why confident BGA rework is needed when dealing with these machines unfortunately. Even in the cases where it doesn't work or last long... It's not for nothing. Just not always the last step.
And fortunately there are further steps.

Lastly about the temperatures... Even 90nm RSXs can run at 54c as you say.
In fact many do, even without delidding or any modifications. And even those can fail. Blaming everything to heat can be another trap.

Cheers
 
The 3000s slims are not that worth it (to me at least ;) cuz they are kinda hard to delidd the IHS so when t breaks, you have a hard time to repair it, asd far as modding it then and ajust the fan speed goes yeah u gonna have a nice decent quiet system ::))


Let me ask you something. I have slim model 2000 and when i press power button console turns on
( green light) fan is not spinning at all (can't even perform fan test), and after 10-15 seconds shuts down. Does that mean the processor has failed or the fan can't cool down chips so it overheats and shuts down? Its puzzling really
 
The only "thing" where I'll slightly differ is that you only reball "once". That's the whole point.
You don't really reball just because you want it to last "longer". It's because of the certainty you get. Also known as peace of mind.
If a properly reballed machine fails again after like 1 year or less as you say... You don't need to reball again because you can be sure that the balls are fine, if you weren't sure before.
That's when you know that your next step is a replacement.
And this is why "reflows" or other half assed methods are not helpful. Not simply because they'll "last less"... But because they are a downward spiral and provide no certainty at all. In the end, the board will be totally unrepairable and you'll still be wondering why.

Because the truth is, when we diagnose the overwhelmingly common "RSX issues"... We aren't just talking about the balls under the chip. Oh no. As you quickly mentioned, the problem can also be on top of the chip.
The real ratio? I wish I knew. But people quickly like to disregard one possibility over the other. Which is never good.

Now as the name suggests, Reballing can only really fix the balls. It's not magic either. But it's still the only "definitive" way to say if the chip is good or bad.
This is why confident BGA rework is needed when dealing with these machines unfortunately. Even in the cases where it doesn't work or last long... It's not for nothing. Just not always the last step.
And fortunately there are further steps.

Lastly about the temperatures... Even 90nm RSXs can run at 54c as you say.
In fact many do, even without delidding or any modifications. And even those can fail. Blaming everything to heat can be another trap.

Cheers
I added a "how many times can I reflow/reball section" to the FAQ.
 
My understanding of electromigration and other actual silicon defects is that rework would rarely have any impact, and it would never make it past real stress testing.
On the atomic scale electrons are moving around the atom so fast that they form a cloud around the nucleus. Imagine if you looked up at night and saw a continuous blur of light because the moon was orbiting around the earth so fast all you saw was a line. Then iagine it didn't just stay in it's orbit, but could move randomly in every direction. It would form a cloud around the earth. Well, electrons are like that. The hotter it is, the faster they move and more energy they have.

On the nano scale atoms are lined up to create a trace. Their electron clouds overlap acording to their bond. The electrons can to pass from cloud to cloud, so long as there is a certain number of them, it doesne't have to be the same one. That what electricity is. We dump a bunch of electron on one end of the trace, and these race along the electron clouds toward the other end. It's just like dumping a bunch of water from a dam. Current is how much water is flowing down the river, voltage is how high the dam is (how much pressure there is forcing the water through).

This is where things get funky! Every microprocessor has "walls" separating traces from other traces. You don't want them shorting, that's bad. Walls are just atoms lines up that don't conduct electricity very well. Copper likes to let electrons flow, FR4 doesn't. Air doesn't. But it will if there is enough voltage, or the traces are really close together. And when you miniturize these structures weird quantum effects start happening. For example, electrons can literally jump from one trace to another through the "wall" that separates them. This is called electromigration. When electrons do this they slowly wear out the wall until it's easy for alot more electrons to flow through the hole. That's an internal short and what you are measuring in the RSX VDDC line when ohm testing it - accumulated electromigration damage! If it's worn a hole to GND, you get a short. The resistance fall setadily as electromigration wears it's way through the "walls" that separate VDDC from GND. Electrons are funky like that.

Obviously it's more complex than that. I didn't mention dendrite formation and electrochemical migration from contamination like flux, water, or dirt and grime.

There are a number of things that accelerate this process. But all of them do the same thing. Anything that gives electrons more energy will accelerate electromigration. The higher the clock frequency the faster electrons are zipping around and the more of them jump to adjacent traces and couple into other signals. And not just the walls, the electrons can destroy the trace itself! The higher the voltage, the faster processors wear out. That's why voltages have been steadily been getting smaller with miniaturization, they need to reduce the voltage to keep separation from +/GND and doing so allows greater transistor density, less heat, better energy efficiency. Both lower voltages and higher clock frequencies have made building decoupling filters more difficult, hence complicated 2 stage RLC filter the tokins are a part of. The smaller we make electronics the bigger this problem gets. But the gains are worth figuring out how to mitigate the issues.

Another thing that gives electrons more energy is heat. Even just the electrons that are part of the traces themselves can get enough energy from reflow temperatures to migrate across the walls that separate VDD/GND. So heat can damage CPU even when no voltage is applied! There's other funky chemistry and quantum effects going on too. Heating anything will cause it's entropy to increase. Chemical reactions occur faster with heat. So even the resins, silicon chemistry, fiberglass/laminating materials decay faster. Lowering the temperature of CPU's decreases these effects significantly, which is why liquid nitrogen allows you to overclock to speeds that will melt down a processor otherwise. removing the heat energy from the system allows the electron you want to follow a conductive path to do more of that, and less of the destructive stuff like blowing holes in the "walls" that separate VDD from GND.

Make sense?
 
nerdy stuff!

I remember hand drawing that junk in VLSI design. Add it to the list of things I've long forgotten anything but general concepts from disuse and booze. I just meant that heat/warping from rework is unlikely to have any restorative impact to non-ball, non-bump chip issues, and they would most certainly re-fail somewhat immediately.

So to way over-simplify you've got:
reball fix, passes stress test -> BGA defect / bump defect
reball fix, fails slow -> bump defect
reball fix, fails fast -> bump defect / die defect
reball no fix -> bump defect / die defect / misdiagnosis

For anyone re-balling, this now matters very much, and is why I am keeping a printout of the note sheets that I was using for data entry in to the spreadsheet taped to each scrap board, since #2 and #3 now sit in piles waiting for modchips (which I STILL keep forgetting to order more of), and #4 sits in another pile by itself for more piddling around when I run out of #2, #3, and miscellaneous high % success work.
 
I realized I never posted the worksheet I use now: https://squeept.com/junk/template.xls

Should fill a normal sheet of paper in landscape. It's pretty basic, but I left plenty of room for random notes. Should probably make a second version for everyone that doesn't have oscilloscopes and rework setups that emphasizes any of the alternative troubleshooting methods, but I don't know what the consensus is there. Feel free to edit your own and share.
 
I realized I never posted the worksheet I use now: https://squeept.com/junk/template.xls

Should fill a normal sheet of paper in landscape. It's pretty basic, but I left plenty of room for random notes. Should probably make a second version for everyone that doesn't have oscilloscopes and rework setups that emphasizes any of the alternative troubleshooting methods, but I don't know what the consensus is there. Feel free to edit your own and share.
My recent study is about slim models where it is dead rsx of 40nm and 65nm in glod state without any errors in syscon . Only fix exchange rsx. On last board I didn't even bother to desolder nothing. Just take measurements and add them for comparison later.
4th case this year on different models up to kte001, this model never seen this before.
http://s.go.ro/sfhd90xz
 
Last edited:
My recent study is about slim models where it is dead rsx of 40nm and 65nm in glod state without any errors in syscon . Only fix exchange rsx. On last board I didn't even bother to desolder nothing. Just take measurements and add them for comparison later.
4th case this year on different models up to kte001, this model never seen this before.
http://s.go.ro/sfhd90xz
I just wanted to post Victors resistance values for anyone following along. It's interesting to see a direct comparison of these. I want to point out that the 2 JSD-001's have different RSX models (CXD5301 on the working console and CXD5300 on the GLOD). It's shouldn't matter? But maybe there are slight differences in the resistances we measure. Also, some of those values will be different on every console. So we're really looking for values that are way off.

Working CECH25xx (JSD-001) and known good RSX (CXD5301)
  • VDDC = 2.7Ω (A Healthy value. New ≈ 3.2Ω)
  • VDDQ = 100.3Ω
  • VDDIO = 96.5Ω
  • YC_RC_VDDIO (FlexIO) = ? (He forgot to measure)
  • VDDA = 61.0Ω
  • VDDR = 449.6Ω
CECH25xx (JSD-001) GLOD Diagnosed with a Dead RSX (CXD5300):
  • VDDC = 1.7Ω (Marginal)
  • VDDQ = 235Ω (High)
  • VDDIO = 95.5Ω (Same)
  • YC_RC_VDDIO (FlexIO) = 12.6Ω (Unknown/No comparison)
  • VDDA = 56.4Ω (Same)
  • VDDR = 315.8Ω (Low)
Discussion:
1.7Ω VDDC is a bit low, but it's within margin. If this was LINK, he'd have one heart left and be panting. But he's not dead yet. There is enough separation on the Core voltage to rule out a short. That's the most common place to burn out.

I would normally be thinking bump failure on the die, if weren't for the GLOD. The RSX must be able to send/receive information to/from the CPU/SYSCON, otherwise it would have triggered a YLOD in POST/BitTraining. So I think that might rule out DIE bump failures.

However, if there were bump failures on the RAM, leading to internal shorts or open lines, then perhaps it's not able to boot because the RSX Die can't communicate properly with it's onboard RAM. Since the issue is on the RSX itself, it's not on the motherboard and doesn't prevent the RSX from responding to SYSCON check-in's, the SYSCON doesn't throw an error code because the RSX is suffering in silence. So the system is stuck in limbo (GLOD) while the RSX fails to get it's $h!T together.

Bump failures seem more likely to me, probably between the DIE and RAM. Perhaps an SMD component on the RSX substrate? Might be worth probing those values with a good chip to see if there's a difference there. If so that would be an easy fix. VDDQ and VDDR are both related to the RAM. And they are the only ones significantly different. These readings rule out short conditions, but not an electromigration open line fault. VDDQ is high, perhaps that's why. It's hard to know what effect an open line fault on some microscopic trace inside the DIE or RAM would have on resistance measured there. Maybe a Bump on the VDDQ line cracked and increased the resistance. That narrative lines up with what we know about aging solder joints - the resistance tends to increase with deformation and oxidation, until an open line forms. However, we also know some resistances, like VDDC tend to decrease. VDDR is the RAM's main voltage, like VDDC is the die's core voltage. Perhaps the lower VDDR is kinda like a health meter for the RAM. Isn't VDDQ used for voltage reference, among other things? It can't be good if it's significantly off.

EDIT:
I just noticed that the difference between the working and GLOD RSX is the same for both VDDR and VDDQ. VDDQ is 135 ohm higher. VDDR is 134 ohms lower. They are both off by about the same amount? That seems sus!

EDIT2:
VDDQ: The supply voltage to the output buffers of a memory chip.
VDDR: Supply voltage to the memory.
 
Last edited:
Let me ask you something. I have slim model 2000 and when i press power button console turns on
( green light) fan is not spinning at all (can't even perform fan test), and after 10-15 seconds shuts down. Does that mean the processor has failed or the fan can't cool down chips so it overheats and shuts down? Its puzzling really

Yo sorry for the late reply
Cant teally tell, maybe fan is broken or the cable or the Plug ^^ but i hope u didnt used it and strill dont cuz yeah, imagine you cant breath anymore (itz just like that ;) and yo try to get a cheap slim ps3 fan at eybay and try him out or open up the ps3 again and see if it really does spin or not or if u take it out and spin it with ya hand if there is some play in there, and if the newer fan also dosent work, well you can throw that baby away :(
 
Yo sorry for the late reply
Cant teally tell, maybe fan is broken or the cable or the Plug ^^ but i hope u didnt used it and strill dont cuz yeah, imagine you cant breath anymore (itz just like that ;) and yo try to get a cheap slim ps3 fan at eybay and try him out or open up the ps3 again and see if it really does spin or not or if u take it out and spin it with ya hand if there is some play in there, and if the newer fan also dosent work, well you can throw that baby away :(
Thanks for reply. I used bluray drive from that slim to fix another one i bought cheap but had faulty laser, so now i don't need to fix this one, i'll use it or sell it for parts hehe. XD
 
PS3 #10
(My first E model! Not metal gear edition tho :/)
This one was Described as "flashing red before shutting down." Sounded like an overheating CPU, but I forgot that it flashes yellow, not red. Upon receiving it is actually a 1-2s YLOD. It doesn't flash red before shutting down, that happens afterwards. Just like a regular YLOD. So that wasn't correctly described by the seller. It could be an easy mistake to make...IDK. Or it might have been intentional. If so, that was a clever deciet. It got me, but then I wasn't paying close enough attention, because I somehow missed the damage on the back ports!

Port Damage 2.jpg Port Damage 1.jpg
The console is sealed, never opened. I saw that sticker and missed the damage in haste! There is sever damage to the LAN/Optical Port. From outward inspection the HDMI port appears to have been spared, but I suspect there are torn traces and SMD damage inside. I haven't broken the seal to inspect it yet. Looking through the vent, I can see the RF shield and some HS fins are bent. So far it's looking like I overpaid ($80). To be fair, the damage was in the photos, I just missed it!

I don't know when I'll actually get around to opening this console. I have a mound of other projects I've been putting off that this PS3 obsession is diverting me from. But I do enjoy the thrill of fixing BC PS3's the most. I don't know why.
I have been wanting to get a COK-002 ever since the PS2 functionality failed to work on PS3#8 (my first frankenstein attempt). I thought there might be an issue with the modchip and decided to buy a COK-002 for testing and because we know for a fact it does work. I was just covering my bases, even though I actually don't have the space for all these consoles anymore and fought the urge to buy another one. A battle I lost...lol! I need to sell a few of them, I just get overwhelmed at the thought of figuring out e-bay, shipping, how not to get screwed, etc. I just gotta bite the bullet and do it. I'm swimming in these consoles and need to recoup some of what I've spent fixing these things.

Anyway, @botakompong has since posted a video proving the modchip does work on COK-001 motherboards. So PS3#8's issue playing PS2 must be related to the console hardware and not the modchip. I'll need to troubleshoot the PS2 hardware and see if I can figure out what's wrong. I guess I didn't need an E model, but oh well. It's nice to have an E-model for testing. Who knows, perhaps it's more stable than A models. Speaking of stability...

PS3#7 - Mini Update
(reballed with tantalum mod)
I did encounter a freeze in Harry Potter and the Sorcerer's Stone. Just after beating the enchanted washing machine, I climbed in the barn window to the left. This is where the first wizard badge (or whatever it's called) is located. Just after getting it there is a cut scene or animation that froze. I could hear it continuing and the sound's of the game running in the background, but there's no way to recover from it without resetting. I repeated it again and it froze in the same place. I even wasted some time running around to see if it happened after a certain period of time or stress. It didn't freeze intil I got that wizard tokin/thing.

I'm hoping that's bug, not an issue with the console itself! Or my tantalum mod! I have yet to attempt to replicate the error on another A model to see if it's universal or just this console. That's the first issue I've run into while playing on PS3#7! I will admit, however, I spend more time fixing consoles and staring at schematics than I do playing games. So I haven't clocked than many hours into any games since "repairing?" it.

That's not to say I don't play games. I just beat Mario odyssey recently. Pretty addictive, but I don't think I'm going to be 100% it anytime soon! That's not to say it isn't good, I really like it. I just don't have that kind of time anymore and I still haven't played breath of the wild yet! The point is that I'm not playing any PS2/PS3 games ATM, so it's kinda hard to thoroughly test these consoles. I wish we had a tool that rand a varied load to thoroughly test the console's full capability (PS1/2/3, bluray, wifi, streaming, music, etc).
 
Last edited:
For further test if I catch one working jsd with his rsx with Samsung ram i will add voltages as well.kte rsx have hynyx ram thats the difference.Even ps4 with Samsung ram fails often then i see more stable rams are Elpida on ps4 ,can be found in phat models and probably ps4 refurbished? Dont know.
Seems Sony is "forgething" to update ram firmware as well?
https://www.guru3d.com/news-story/m...r-geforce-gtx-1070-due-to-memory-issue,2.html
Huh, SONY is using the same micron GDDR5 in some PS4s? Are you saying some of them experience the same graphical artifacting/checkerboarding as seen in some GTX1070's that have micron memory? If not, maybe SONY already updated their firmware and don't need to. I don't work on PS4, so IDK.

Honestly, I don't see the point of PS4 anymore with the PS5 being backwards compatible. That's the kind of thing this industry needs. As hardware ages and dies, there should be a new option to replace it with. Imagine if SONY re-released the PS3? It'd sell out faster than the PS5, especially if it were backwards compatible with PS1/PS2 via software emulation for physical and digital via PSN DLC. The infrasstructure is still there, they just have to merge the PS3 PSN and PS4/5 stores to put all of playstation in one place playable via 2 consoles. OG PS3, new re-release PS3, and PS5. They need to get on that!

Can you imagine a 7nm RSX/Cell? Say byebye to the YLOD!
 
Back
Top