Location>code7788 >text

What happened to the GC end marker SuspendEE?

Popularity:245 ℃/2024-08-21 11:17:25

I: Background

1. Storytelling

The writing of this piece originated when a friend at the boot camp mentioned a problem in the!t -special The output has aSuspendEE character, how is this character made in coreclr? The output is as follows:


0:000> !t -special
ThreadCount:      3
UnstartedThread:  0
BackgroundThread: 2
PendingThread:    0
DeadThread:       0
Hosted Runtime:   no
                                                                                                            Lock  
 DBG   ID     OSID ThreadOBJ           State GC Mode     GC Alloc Context                  Domain           Count Apt Exception
   0    1     4ab0 000001CC44E5C490    2a020 Cooperative 0000000000000000:0000000000000000 000001cc44e520d0 -00001 MTA (GC) 
  11    2     19d8 000001CC44E84700    21220 Preemptive  0000000000000000:0000000000000000 000001cc44e520d0 -00001 Ukn (Finalizer) 
  12    3     6668 000001CC44ED4520    2b220 Preemptive  0000000000000000:0000000000000000 000001cc44e520d0 -00001 MTA 

          OSID Special thread type
        0 4ab0 SuspendEE 
       10 3b6c DbgHelper 
       11 19d8 Finalizer 

Haha, in fact, I can especially understand, a lot of people learn advanced debugging after curiosity will be bursting, look at what all want to explore the bottom, there is a kind of technical rebirth, this article we will have a good chat.

II: WinDbg Analysis

1. What is the SuspendEE marking?

The word is called Suspend Engine Execution.Freeze execution engine So, where is the entry method to freeze the execution engine? This tests your knowledge of the skeleton diagram of GC operations, and there is one in the coreclr source code, simplified as follows:


     GarbageCollectGeneration()
     {
         SuspendEE();
         garbage_collect();
         RestartEE();
     }
     
     garbage_collect()
     {
         generation_to_condemn();
         gc1();
     }

upperSuspendEE() This is the entry function for the SuspendEE tag in SOS, so let's dig deeper into this method.

2. What did SuspendEE do?

If you read through the source code for the SuspendEE() method, you'll see that the core enumeration variable is theThreadType_DynamicSuspendEE, it plays a role in setting the scene, the reference code is as follows:


thread_local size_t t_ThreadType;

void ThreadSuspend::SuspendEE(SUSPEND_REASON reason)
{
    // set tls flags for compat with SOS
    ClrFlsSetThreadType(ThreadType_DynamicSuspendEE);
}

void ClrFlsSetThreadType(TlsThreadTypeFlag flag)
{
    t_ThreadType |= flag;

    gCurrentThreadInfo.m_EETlsData = (void**)&t_ThreadType - TlsIdx_ThreadType;
}

enum PredefinedTlsSlots
{
    TlsIdx_ThreadType = 11 // bit flags to indicate special thread's type
};

enum TlsThreadTypeFlag // flag used for thread type in Tls data
{
    ThreadType_DynamicSuspendEE = 0x00000020,
}

From the above code we can see that t_ThreadType is a C++ level thread local storage, which means that every thread has its backup, and it is also the core source of the SuspendEE flag, if the 11th slot of m_EETlsData is 0x20, the SuspendEE flag is successfully hit, and the source can be tracked by the gCurrentThreadInfo.m_EETlsData variable to track down the source. With this much information, the next step is to verify the code.

III: Case validation

1. A test code

The code is very simple, a simple manual GC trigger.


    internal class Program
    {
        static void Main(string[] args)
        {
            ();

            ();

            ();
        }
    }

Next, use windbg to place a breakpoint on the SuspendEE method at the entry point.bp coreclr!ThreadSuspend::SuspendEE Observe, screenshot below:

once theThreadType_DynamicSuspendEE=0x20 After assigning the value, you can then use windbg to do a validation.


0:000> x coreclr!*gCurrentThreadInfo*
000001a1`668ee8c0 coreclr!gCurrentThreadInfo = struct ThreadLocalInfo

0:000> dx -id 0,0 -r1 (*((coreclr!ThreadLocalInfo *)0x1a1668ee8c0))
(*((coreclr!ThreadLocalInfo *)0x1a1668ee8c0))                 [Type: ThreadLocalInfo]
    [+0x000] m_pThread        : 0x1a166902e50 [Type: Thread *]
    [+0x008] m_pAppDomain     : 0x1a166948b40 [Type: AppDomain *]
    [+0x010] m_EETlsData      : 0x1a1668ee880 [Type: void * *]

0:000> dp 0x1a1668ee880
000001a1`668ee880  00000000`00000000 00000000`00000000
000001a1`668ee890  00000000`00000000 00000000`00000000
000001a1`668ee8a0  00000000`00000000 00000000`00000000
000001a1`668ee8b0  00000000`00000000 00000000`00000000
000001a1`668ee8c0  000001a1`66902e50 000001a1`66948b40
000001a1`668ee8d0  000001a1`668ee880 00000000`00000020

As you can see from the output above000001a1668ee8d0+0x8 The content of the address has been successfully seeded, and I believe that by this time the!t -special Can get the markers too.


0:000> !t -special
                                                                                                            Lock  
 DBG   ID     OSID ThreadOBJ           State GC Mode     GC Alloc Context                  Domain           Count Apt Exception
   0    1     640c 000001A166902E50    2a020 Preemptive  000001A16B0094A8:000001A16B00A5B8 000001a166948b40 -00001 MTA (GC) 
  11    2     3e50 000001A16692B2D0    21220 Preemptive  0000000000000000:0000000000000000 000001a166948b40 -00001 Ukn (Finalizer) 
  12    3     6a24 000001A16699F8F0    2b220 Preemptive  0000000000000000:0000000000000000 000001a166948b40 -00001 MTA 

    OSID Special thread type
        0 640c SuspendEE 
       10 76b0 DbgHelper 
       11 3e50 Finalizer 

So when is this 0x20 taken out? The answer to this can be found in the source code, continue to go run, the output is as follows:


void ClrFlsClearThreadType(TlsThreadTypeFlag flag)
{
    t_ThreadType &= ~flag;
}

0:012> dp 0x1a1668ee880
000001a1`668ee880  00000000`00000000 00000000`00000000
000001a1`668ee890  00000000`00000000 00000000`00000000
000001a1`668ee8a0  00000000`00000000 00000000`00000000
000001a1`668ee8b0  00000000`00000000 00000000`00000000
000001a1`668ee8c0  000001a1`66902e50 000001a1`66948b40
000001a1`668ee8d0  000001a1`668ee880 00000000`00000000

Of course if you go looking for the source code implementation of sos, you will find the appropriate answer as well.


HRESULT PrintSpecialThreads()
{
    ...
    if (ThreadType & ThreadType_DynamicSuspendEE)
    {
        type += "SuspendEE ";
    }
    ...
    return Status;
}

IV: Summary

Digging into the past life of this tag is actually quite interesting looking back, coreclr actually added the m_EETlsData field to give sos a compromise, haha, this highlights the status of sos as a first class citizen.
图片名称