Re: [Gems-users] Trouble with MOESI_CMP_NUCA


Date: Sat, 8 Apr 2006 13:25:51 -0600
From: Steve Barrus <sbarrus@xxxxxxxxxxxx>
Subject: Re: [Gems-users] Trouble with MOESI_CMP_NUCA
I inadvertently modified the check point I was working with so I am
unable to get that same trace.  Here is a different trace with the
same problem.  I also including the extra debug info for the following
transition.  Hopefully, that will provide you with the information
that you are looking for.

 398531   0   2  Collector        Issue_L2_Get  Col_P>Col_P [0x22e9c0, line 0x22e9c0]

Thanks again for you help.

-Steve

On Fri, Apr 07, 2006 at 06:03:38PM -0500, Bradford Beckmann wrote:
> 
> That's right I did put an assertion there.  Good so you have
> COLLECTOR_HANDLES_OFF_CHIP_REQUESTS set to true.
> 
> Well on a second, closer look at your debug output, I'm afraid this may be
> a more complicated problem.  The problem is that the L1_GETS and
> PERSISTENT_GETS requests are not being received by all L2 Caches.  This
> leads me to believe that the network is clogged or something really weird
> is happening in the L2 cache mapping functions.  This is really confusing
> because the protocol works fine when I run it.
> 
> I'm going to need more debugging information.  Please do the following:
> 
> - In the action "s_sendSecondPhaseRequest" in the file
> MOESI_CMP_NUCA-col.sm add the following two lines:
> 
>         out_msg.Destination := in_msg.RequestDest;
>         out_msg.RequestsPerRound := out_msg.Destination.count();
>         DEBUG_EXPR(out_msg.Destination);       // <-add line
> 	DEBUG_EXPR(out_msg.RequestsPerRound);  // <-add line
>         out_msg.RetryNum := in_msg.RetryNum;
>         out_msg.MessageSize := in_msg.MessageSize;
> 
> - Then in simics run the following two commands before the 'c' command
> 
> ruby0.debug-verb high
> ruby0.debug-filter l
> 
> - Send me the debug output for this transition:
> 
> 314527   0   3  Collector        Issue_L2_Get  Col_P>Col_P
> [0x30a840, line 0x30a840]
> 
> Brad
> 
 398530   0   2  Collector       Miss_Get_last Col_NP>Col_P  [0x22e9c0, line 0x22e9c0] 
../protocols/MOESI_CMP_NUCA-col.sm:165: [0x22e9c0, line 0x22e9c0]
../protocols/MOESI_CMP_NUCA-col.sm:166: [NetDest (4) 0 0 0 0 0 0 0 0  - 0 0 0 0 0 0 0 1
 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 
0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0
 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 
0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 
0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0  - 0 0 0 0 0 0 0 0  - 0 0
 0 0 0 0 0 0  - ]
../protocols/MOESI_CMP_NUCA-col.sm:167: Collector-2
../protocols/MOESI_CMP_NUCA-col.sm:168: Col_P
../protocols/MOESI_CMP_NUCA-col.sm:169: ISSUE_L2_GET
../protocols/MOESI_CMP_NUCA-col.sm:280: [NetDest (4) 0 0 0 0 0 0 0 0  - 0 0 0 0 0 0 0 1
 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 
0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0
 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 
0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 
0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0  - 0 0 0 0 0 0 0 0  - 0 0
 0 0 0 0 0 0  - ]
../protocols/MOESI_CMP_NUCA-col.sm:281: 7
 398531   0   2  Collector        Issue_L2_Get  Col_P>Col_P  [0x22e9c0, line 0x22e9c0] 
 398532   0  -1        Seq               Begin       >       [0x14000, line 0x14000] 
../protocols/MOESI_CMP_NUCA-L1cache.sm:647: [0x14000, line 0x14000]
../protocols/MOESI_CMP_NUCA-L1cache.sm:648: LD
../protocols/MOESI_CMP_NUCA-L1cache.sm:649: L1Cache-0
../protocols/MOESI_CMP_NUCA-L1cache.sm:650: M
../protocols/MOESI_CMP_NUCA-L1cache.sm:651: 0
../protocols/MOESI_CMP_NUCA-L1cache.sm:652: 1
../protocols/MOESI_CMP_NUCA-L1cache.sm:653: 1
../protocols/MOESI_CMP_NUCA-L1cache.sm:1516: [0x14000, line 0x14000]
../protocols/MOESI_CMP_NUCA-L1cache.sm:1517: ]
 398536   0  -1        Seq                Done       >       [0x14020, line 0x14000] 4 
cycles L1Cache No
 398500   2  -1        Seq               Begin       >       [0x22e9c0, line 0x22e9c0] 
 398504   0   2    L1Cache                Load     NP>IS     [0x22e9c0, line 0x22e9c0] 
 398516   0 135    L2Cache             L1_GETS  L2_NP>L2_NP  [0x22e9c0, line 0x22e9c0] 
 398518   0   3  Collector            Miss_Get Col_NP>Col_P  [0x22e9c0, line 0x22e9c0] 
 398530   0   2  Collector       Miss_Get_last Col_NP>Col_P  [0x22e9c0, line 0x22e9c0] 
 398531   0   2  Collector        Issue_L2_Get  Col_P>Col_P  [0x22e9c0, line 0x22e9c0] 
 398536   0 231    L2Cache             L1_GETS  L2_NP>L2_NP  [0x22e9c0, line 0x22e9c0] 
 398538   0 247    L2Cache             L1_GETS  L2_NP>L2_NP  [0x22e9c0, line 0x22e9c0] 
 398540   0 151    L2Cache             L1_GETS  L2_NP>L2_NP  [0x22e9c0, line 0x22e9c0] 
 398542   0  23    L2Cache             L1_GETS  L2_NP>L2_NP  [0x22e9c0, line 0x22e9c0] 
 398546   0   2  Collector            Miss_Get  Col_P>Col_P  [0x22e9c0, line 0x22e9c0] 
 398549   0   7    L2Cache             L1_GETS  L2_NP>L2_NP  [0x22e9c0, line 0x22e9c0] 
 398549   0   7    L2Cache             L1_GETS  L2_NP>L2_NP  [0x22e9c0, line 0x22e9c0] 
 398550   0   2  Collector            Miss_Get  Col_P>Col_P  [0x22e9c0, line 0x22e9c0] 
 398550   0   7    L2Cache             L1_GETS  L2_NP>L2_NP  [0x22e9c0, line 0x22e9c0] 
 398554   0   2  Collector            Miss_Get  Col_P>Col_P  [0x22e9c0, line 0x22e9c0] 
 398558   0   2  Collector            Miss_Get  Col_P>Col_P  [0x22e9c0, line 0x22e9c0] 
 398571   0   7  Collector            Miss_Get Col_NP>Col_P  [0x22e9c0, line 0x22e9c0] 
 398571   0   5  Collector            Miss_Get Col_NP>Col_P  [0x22e9c0, line 0x22e9c0] 
 398572   0   6  Collector            Miss_Get Col_NP>Col_P  [0x22e9c0, line 0x22e9c0] 
 398601   0 119    L2Cache             L1_GETS  L2_NP>L2_NP  [0x22e9c0, line 0x22e9c0] 
 398617   0   2  Collector            Miss_Get  Col_P>Col_P  [0x22e9c0, line 0x22e9c0] 
 398678   0   7    L2Cache             L1_GETS  L2_NP>L2_NP  [0x22e9c0, line 0x22e9c0] 
 398700   0   4  Collector            Miss_Get Col_NP>Col_P  [0x22e9c0, line 0x22e9c0] 
 398804   0   2    L1Cache     Request_Timeout     IS>IS     [0x22e9c0, line 0x22e9c0] 
 398808   0   2    L1Cache  Own_Lock_or_Unlock     IS>IS     [0x22e9c0, line 0x22e9c0] 
 398808   0  39    L2Cache     Persistent_GETS L2_I_L>L2_I_L [0x22e9c0, line 0x22e9c0] 
 398814   0 151    L2Cache     Persistent_GETS L2_I_L>L2_I_L [0x22e9c0, line 0x22e9c0] 
 398815   0   1    L1Cache     Persistent_GETS    I_L>I_L    [0x22e9c0, line 0x22e9c0] 
 398816   0   3    L1CachePersistent_GETS_Last_Token      S>S_L    [0x22e9c0, line 0x22e9c0] 
 398816   0 135    L2Cache     Persistent_GETS L2_I_L>L2_I_L [0x22e9c0, line 0x22e9c0] 
 398818   0   3  Collector    Deallocate_Entry  Col_P>Col_NP [0x22e9c0, line 0x22e9c0] 
 398819   0  23    L2Cache     Persistent_GETS L2_I_L>L2_I_L [0x22e9c0, line 0x22e9c0] 
 398822   0   4    L1Cache     Persistent_GETS      S>S_L    [0x22e9c0, line 0x22e9c0] L1Cache-2
 398824   0   7    L1Cache     Persistent_GETS      I>I_L    [0x22e9c0, line 0x22e9c0] L1Cache-2
 398830   0   5    L1Cache     Persistent_GETS      I>I_L    [0x22e9c0, line 0x22e9c0] L1Cache-2
 398830   0   2  Collector    Deallocate_Entry  Col_P>Col_NP [0x22e9c0, line 0x22e9c0] 
 398832   0   6    L1Cache     Persistent_GETS      I>I_L    [0x22e9c0, line 0x22e9c0] L1Cache-2
 398841   0   2    L1Cache                 Ack     IS>IS     [0x22e9c0, line 0x22e9c0] 
 398851   0   0    L1Cache     Persistent_GETS      S>S_L    [0x22e9c0, line 0x22e9c0] L1Cache-2
 398860   0   7  Directory            Lockdown     NO>L      [0x22e9c0, line 0x22e9c0] L1Cache-2
 398870   0   2    L1Cache                 Ack     IS>IS     [0x22e9c0, line 0x22e9c0] 
 398871   0   5  Collector    Deallocate_Entry  Col_P>Col_NP [0x22e9c0, line 0x22e9c0] 
 398871   0   7  Collector    Deallocate_Entry  Col_P>Col_NP [0x22e9c0, line 0x22e9c0] 
 398872   0   6  Collector    Deallocate_Entry  Col_P>Col_NP [0x22e9c0, line 0x22e9c0] 
 398936   0   7    L2Cache             L1_GETS  L2_NP>L2_NP  [0x22e9c0, line 0x22e9c0] 
 398958   0   2  Collector            Miss_Get Col_NP>Col_P  [0x22e9c0, line 0x22e9c0] 
 399000   0   4  Collector    Deallocate_Entry  Col_P>Col_NP [0x22e9c0, line 0x22e9c0] 
 399258   0   2  Collector    Deallocate_Entry  Col_P>Col_NP [0x22e9c0, line 0x22e9c0] 
 410500   7  -1        Seq               Begin       >       [0x22e9c0, line 0x22e9c0] 
 410504   0   7    L1Cache                Load    I_L>IS_L   [0x22e9c0, line 0x22e9c0] 
 410804   0   7    L1Cache     Request_Timeout   IS_L>IS_L   [0x22e9c0, line 0x22e9c0] 
 410808   0   7    L1Cache     Persistent_GETS   IS_L>IS_L   [0x22e9c0, line 0x22e9c0] 
 410811   0 119    L2Cache     Persistent_GETS L2_I_L>L2_I_L [0x22e9c0, line 0x22e9c0] 
 410812   0   7    L2Cache     Persistent_GETS L2_I_L>L2_I_L [0x22e9c0, line 0x22e9c0] 
 410816   0 135    L2Cache     Persistent_GETS L2_I_L>L2_I_L [0x22e9c0, line 0x22e9c0] 
 410816   0   6    L1Cache     Persistent_GETS    I_L>I_L    [0x22e9c0, line 0x22e9c0] 
 410818   0 151    L2Cache     Persistent_GETS L2_I_L>L2_I_L [0x22e9c0, line 0x22e9c0] 
 410819   0   0    L1CachePersistent_GETS_Last_Token    S_L>S_L    [0x22e9c0, line 0x22e9c0] 
 410823   0   5    L1Cache     Persistent_GETS    I_L>I_L    [0x22e9c0, line 0x22e9c0] 
 410824   0   2    L1Cache  Own_Lock_or_Unlock     IS>IS     [0x22e9c0, line 0x22e9c0] 
 410829   0   3    L1CachePersistent_GETS_Last_Token    S_L>S_L    [0x22e9c0, line 0x22e9c0] 
 410831   0   4    L1CachePersistent_GETS_Last_Token    S_L>S_L    [0x22e9c0, line 0x22e9c0] 
 410839   0  23    L2Cache     Persistent_GETS L2_I_L>L2_I_L [0x22e9c0, line 0x22e9c0] 
 410844   0   1    L1Cache     Persistent_GETS    I_L>I_L    [0x22e9c0, line 0x22e9c0] 
 410859   0   7  Directory            Lockdown      L>L      [0x22e9c0, line 0x22e9c0] 
Warning: in fn virtual void Sequencer::wakeup() in system/Sequencer.C:103: Possible Deadlock detected
Warning: in fn virtual void Sequencer::wakeup() in system/Sequencer.C:104: request is [CacheMsg: Address=[0x22e9c0, line 0x22e9c0] Type=LD ProgramCounter=[0x42cbec, line 0x42cbc0] AccessMode=SupervisorMode Size=4 Prefetch=No Version=0 Aborted=0 Time=398500 ]
Warning: in fn virtual void Sequencer::wakeup() in system/Sequencer.C:105: m_chip_ptr->getID() is 0
Warning: in fn virtual void Sequencer::wakeup() in system/Sequencer.C:106: m_version is 2
Warning: in fn virtual void Sequencer::wakeup() in system/Sequencer.C:107: keys.size() is 1
Warning: in fn virtual void Sequencer::wakeup() in system/Sequencer.C:108: current_time is 450001
Warning: in fn virtual void Sequencer::wakeup() in system/Sequencer.C:109: request.getTime() is 398500
Warning: in fn virtual void Sequencer::wakeup() in system/Sequencer.C:110: current_time - request.getTime() is 51501
Warning: in fn virtual void Sequencer::wakeup() in system/Sequencer.C:111: *m_readRequestTable_ptr is [ [0x22e9c0, line 0x22e9c0]=[CacheMsg: Address=[0x22e9c0, line 0x22e9c0] Type=LD ProgramCounter=[0x42cbec, line 0x42cbc0] AccessMode=SupervisorMode Size=4 Prefetch=No Version=0 Aborted=0 Time=398500 ] ]
Fatal Error: in fn virtual void Sequencer::wakeup() in system/Sequencer.C:112: Aborting
***  Simics getting shaky, switching to 'safe' mode.
***  Simics (main thread) received an abort signal, probably an assertion.
[← Prev in Thread] Current Thread [Next in Thread→]