2011年12月25日星期日
MICRO'11: Parallel Application Memory Scheduling
2011年12月24日星期六
FMCAD'11: Effective Word-Level Interpolation for Software Verification
FMCAD'11: IC3: Where Monolithic and Incremental Meet
2011年12月23日星期五
ASPLOS'10: Conservation Cores: Reducing the Energy of Mature Computations
HPCA'10: Aérgia: Exploiting Packet Latency Slack in On-Chip Networks
2011年12月22日星期四
HPCA'03 : Runahead Execution: An Alternative to Very Large Instruction Windows for Out-of-order Processors
2011年12月14日星期三
OSDI'10: FlexSC: Flexible System Call Scheduling with Exception-Less System Calls
2011年12月2日星期五
MICRO'11: Q S C ORES: Trading Dark Silicon for Scalable Energy Efficiency with Quasi-Specific Cores
MICRO'11: Manager-Client Pairing: A Framework for Implementing Coherence Hierarchies
MICRO'11:PACMan: Prefetch-Aware Cache Management for High Performance Caching
2011年12月1日星期四
MICRO'11:Idempotent Processor Architecture
2011年11月30日星期三
MICRO'11: Architectural Support for Secure Virtualization under a Vulnerable Hypervisor
MICRO'11: Towards the Ideal On-chip Fabric for 1-to-Many and Many-to-1 Communication
2011年11月27日星期日
CAV'11: Linear Completeness Thresholds for Bounded Model Checking
2011年11月25日星期五
CAV'11: Interactive Synthesis of Code Snippets
CAV'11: A Specialization Calculus for Pruning Disjunctive Predicates to Support Verification
2011年11月24日星期四
CAV'10: Smoothing a Program Soundly and Robustly
PLDI'10: Smooth interpretation
CAV'11: Quantitative Synthesis for Concurrent Programs
2011年11月23日星期三
CAV'11: Resolution Proofs and Skolem Functions in QBF Evaluation and Applications
2011年11月22日星期二
MICRO'10: Register Cache System not for Latency Reduction Purpose
MICRO'10: Combating Aging with the Colt Duty Cycle Equalizer
2011年11月21日星期一
MICRO'10:Single-Chip Heterogeneous Computing: Does the Future Include Custom Logic, FPGAs, and GPGPUs?
MICRO'10:Voltage Smoothing: Characterizing and Mitigating Voltage Noise in Production Processors via Software-Guided Thread Scheduling
2011年11月19日星期六
MICRO'10 : Probabilistic Distance-based Arbitration: Providing Equality of Service for Many-core CMPs
MICRO'10:Fractal Coherence: Scalably Verifiable Cache Coherence
2011年11月18日星期五
MICRO'10: Virtual Snooping: Filtering Snoops in Virtualized Multi-cores
MICRO'10: Adaptive Flow Control for Robust Performance and Energy
2011年11月17日星期四
MICRO'10: ScalableBulk: Scalable Cache Coherence for Atomic Blocks in a Lazy Environment
2011年11月4日星期五
MICRO10 : Pseudo-Circuit: Accelerating Communication for On-Chip Interconnection Networks
MICRO10: Erasing Core Boundaries for Robust and Configurable Performance
MICRO10: Parichute: Generalized Turbocode-Based Error Correction for Near-Threshold Caches
MICRO10:Minimal Multi-Threading: Finding and Removing Redundant Instructions in Multi-Threaded Processors
2011年10月26日星期三
Rippling: a heuristic for guiding inductive proofs
So rippling just convert each inferring rules of this reasoning system into a new rewriting rule, which transforms formulas with the same skeleton.
Such transformation is very similar to the rippling wave on the water surface, which is the source of the name rippling.
2011年10月23日星期日
SOSP11: PTask: Operating System Abstractions To Manage GPUs as Compute Devices
These tasks can be schedule on to GPU as well as CPUs. In this way, non-neccesarry copy between devices, drivers and user codes are avoid, and the user does not need to care about the data movement explictly.
SOSP11:Scalable Consistency in Scatter
2011年9月27日星期二
MICRO10: Flexible and Efficient Instruction-Grained Run-Time Monitoring Using On-Chip Reconfigurable Fabric
MICRO10: Memory Latency Reduction via Thread Throttling
This paper propose a gather computie scatter programing style that seperate the program into computing and memory thread, and schedule both of them to make sure that the number of memory thread would not over run the limited memory bandwidth.
MICRO10: The ZCache: Decoupling Ways and Associativity
2011年8月28日星期日
POPL02:Lazy Abstraction
2011年8月19日星期五
CAV11:HAMPI: A String Solver for Testing, Analysis and Vulnerability Detection
CAV11:Interpolation-Based Software Verification with Wolverine
2011年8月8日星期一
TACAS11:Sound and Complete Monitoring of Sequential Consistency for Relaxed Memory Models
2011年8月7日星期日
FMCAD11:Effective Word-Level Interpolation for Software Verification
FMCAD07:Lifting Propositional Interpolants to the Word-Level
VMCAI11:SAT-Based Model Checking Without Unrolling
This approach contains two nested loop,
1. in the major loop, it iteratively constructs a list of formulas F0....Fk, in which Fk means the over-approximation of the states set reachable in k steps.
2. in the minor loop, it iteratively refines the F0...Fk, such that they form an inductive path that leads to the property P to be verified.
DAC11:Re-Synthesis for Cost-Efficient Circuit-Level Timing Speculation
This paper reduce the number of paths that need "Timing Speculation" by retiming to improve timing character.
And then it use ILP to share pading buffers between different paths.
MICRO03:Razor: A Low-Power Pipeline Based on Circuit-Level Timing Speculation
So this paper propose to use a shadow register to sample the critical path a little bit later than the real register, and then compare them. If the result is not the same, then a circuit is used to roll back the correct state.
One major overhead of "Timing Speculation" is the need to pad the short paths, to prevent them from making the comparator to think that a too long path has cause a timing error.
DAC11:Abstraction-Based Performance Analysis of NoCs
This traffic pattern include the burst transition number, transition rate and bound.
And then it generate a regulator for each channel that represent this pattern and insert them into the correspoding channel, and constraint that they all hold in verification.
In this way, the traffic pattern that we interest in will be forced to hold.
This approach can also be extended to multiple channels, for example, to constrain that we dont allow five source to flood the sink at the same time, be we allow one of them to send.
Inferring Assertion for Complementary Synthesis" is accepted by ICCAD'1
My paper "Inferring Assertion for Complementary Synthesis" has just been accepted by ICCAD'11.
TACAS11: Canonized Rewriting and Ground AC Completion Modulo Shostak Theories
Very interesting paper that combine AC(ssociativity and commutativity) operators with Shostak approach.
Generally, there are two approaches for theory combinations, nelson-oppen and Shostak.The former one just pass equalities between T-solvers, so it is relativly easy to combine AC with it.
But for Shostak, it need to computer canonical form, which is not possible for AC.
TACAS11: Transition Invariants and Transition Predicate Abstraction for Program Termination
Normal invaraints and abstraction are for states, while this paper apply them to transitions.
But I think the disjunctive well foundness is a very well known result.
I dont understand why this paper published on TACAS 11 again, these topic are well discussed before 2006.
CAV11: Verification of Certifying Computations
This checker is actually an over-approximation of the behavior of the original program.
CAV11: Existential Quantification as Incremental SAT
But I think the major innovation is to use sorting network to find out the shortest prime clause. This is bettern than my approach in VMCAI05: Minimizing Counterexample with Unit Core Extraction and Incremental SAT, which reduce the size of a block clause by iteratively remove those literal not in the support set of a proof.
CAV11: Temporal property verification as a program analysis task
It is a very interesting paper. Given a transition system and a temporal logc property, generate a program that search for the proof or counterexample of this property. and then use existing program analysis tools to determine that whether this program terminate with some answer.
All the methods that boost the speed, such as abstraction refinement, will be automatically performed by the program analysis tools.
2011年3月30日星期三
TACAS'11::Optimal Base Encodings for Pseudo-Boolean Constraints
Of course there are some direct solving methods, such as that of PBS solver proposed by Fadi Aloul .
But it seems that translating into SAT is also very interesting. it refer to some early papers:
1. A Translation of Pseudo-Boolean Constraints to SAT:
This paper just iterate though all coefficient, and case split on they to consider the two cases of them, and substract the correspond value from the right handside of (dis)equation.
In this way, an SAT instance is encoded.
2. Translating Pseudo-Boolean Constraints into SAT
it propose three methods
a. BDD based: similar to above method
b. adder network: it express the coefficient with a set of 2-exp, and add them together with adders.but because there are XOR gates in these adders, it is Not arc-consistent.
c. sorting network : for small coefficients n, it use n length bit vector to express it.Assume again that the sum of all coe is N, then a sorter of size N is instanced.For variables x and its coe n, x is connected to n inputs of the sorter. For right hand side x, the 0-th to x-th output of sorter is asserted to be 1.
To deal with larger coe, a set of base must be founds, such that each sorter can be used to express larger number that 1.
To minimize the size of generated SAT instance, a set of optimal base must be found, this is the motivation of this TACAS11 paper.
This TACAS11 paper propose three cost function to evaluate the total cost that need to be minimized:
a. summing the digits used to express coes: only bases with only prime numbers are need to be considered.
b. also taking into acount of the carry bits between sorters: non-prime based must be consider
c. consider the number of comparators in sorters.
With these cost function, it proposes three searching methods, they approximate the cost function and perform branch and bound searching to find a minimized cost base.