Re: [Gems-users] Understand the protocol trace


Date: Mon, 25 Sep 2006 12:06:54 -0500
From: "Lei Yang" <lya755@xxxxxxxxxxxxxxxxxxxx>
Subject: Re: [Gems-users] Understand the protocol trace
I just found in the documentation that PT_TO_PT and FILE_SPECIFIED are the recommended network topologies for the CMP protocols. Will give that a try.

Lei
----- Original Message ----- From: "Mike Marty" <mikem@xxxxxxxxxxx> To: "Lei Yang" <lya755@xxxxxxxxxxxxxxxxxxxx>; "Gems Users" <gems-users@xxxxxxxxxxx>
Sent: Monday, September 25, 2006 11:59 AM
Subject: Re: [Gems-users] Understand the protocol trace


1)  Turn off RANDOMIZATION in $GEMS/ruby/config/testerconfig.defaults.
This randomly adds 100+ cycle delays to generate race conditions

2)  You are probably using a non-CMP topology and NETWORK_LINK_LATENCY is
fairly high.

--Mike


Dear list,

I was experimenting with MSI_MOSI_CMP_directory protocol with the tester. With the little.trace on GEMS online documentation http://www.cs.wisc.edu/gems/doc/wiki/moin.cgi/How_do_I_understand_a_Protocol , below is the protocol trace I got:

Testing clear stats...Done.
Reading trace from file 'little.trace'...
1 7 -1 Seq Begin > [0x400, line 0x400] 4 1 3 L1Cache Load NP>L1_IS [0x400, line 0x400] 141 1 0 L2Cache L1_GETS L2_NP>L2_IS [0x400, line 0x400] 390 0 0 Directory GETS NP>S [0x400, line 0x400] 635 1 0 L2Cache Data_ext_ack_0 L2_IS>L2_SS [0x400, line 0x400] 1097 7 -1 Seq Done > [0x400, line 0x400] 1096 cycles NULL Yes 1097 1 3 L1Cache L1_Data L1_IS>L1_S [0x400, line 0x400] 1101 1 -1 Seq Begin > [0x400, line 0x400] 1104 0 1 L1Cache Load NP>L1_IS [0x400, line 0x400] 1139 0 0 L2Cache L1_GETS L2_NP>L2_IS [0x400, line 0x400] 1176 0 0 Directory GETS S>S [0x400, line 0x400] 1309 0 0 L2Cache Data_ext_ack_0 L2_IS>L2_SS [0x400, line 0x400] 1445 1 -1 Seq Done > [0x400, line 0x400] 344 cycles NULL Yes 1445 0 1 L1Cache L1_Data L1_IS>L1_S [0x400, line 0x400]

According to the documentation, the first column indicates the cycle. I don't understand why the operation cycles are so large. In my configuration,

MEMORY_RESPONSE_LATENCY_MINUS_2: 78
DIRECTORY_LATENCY: 80
L2_RESPONSE_LATENCY: 6
L1_RESPONSE_LATENCY: 3
L1_REQUEST_LATENCY: 2
L2_REQUEST_LATENCY: 4
NETWORK_LINK_LATENCY: 40

I don't understand why the cache operations would add up to, 1096 cycles for the first LD as an example. Could someone explain this please? By the way, in the ruby config file, there is a TIMER_LATENCY: 10000. I wonder what this is.

Thanks a lot! I appreciate your comments.

Lei




[← Prev in Thread] Current Thread [Next in Thread→]