Elliptic Approximations of Propositional Formulae Hans van Maaren
Abstract. A propositional formula can be approximated by a concave quadratic function. This
approximation is obtained as a second order Taylor expansion of a convex smooth model. It is shown that in the 3- SAT case, the involved parameters can be set to such values that yield optimal discriminative properties. Two concentric (generally elliptic) quadratic convex regions are established, the inner one containing only satis able assignments and the outer one excluding the average non satis able assignment and including all satis able assignments. Keywords. Satis ability, convexity, quadratic approximations, quadratic valid cuts. 1991 Mathematics Subject Classi cation. Primary 68Q25,90C09, Secondary 90C10, 90C27.
1 Introduction. Smooth convex and concave transforms of propositional formulae were introduced by van Maaren, Groote and Rozema in [4]. In [5] the eigenvalues of the associated Hessian matrices were used to design a branching variable heuristic which has (experimentally) been shown to result in relatively small search trees, even in case that no other additional node procedure than unit clause elimination was involved. In this paper we investigate the second order Taylor expansion of the smooth concave model on its discriminative properties. It is shown that, in general, a parametric family of valid convex quadratic cuts can be derived and that, speci cally in the 3-SAT case, parameter values can be established in such a way that these valid cuts separate the \average" non satis able truth assignment from all satis able ones. This is done by deriving a threshold value which is the solution to a parametric convex quadratic programming problem. In other words, a convex quadratic region is derived which contains all solutions to the SAT problem involved but which excludes most of the non solutions. Generally this region is shown to be an elliptic region. The above threshold value depends only on the global characteristics of the CNF formula involved, being the numbers Sm , which indicate the number of m-literal clauses. The aim of this research is to provide geometric insight in the satis ability problem and to make a start with using the quadratic valid cuts in order to yield linear valid cuts of speci c interest. As to the rst goal we include a discussion on balanced formula [1] and their geometric representations. As to the second we explicitly derive a formula yielding the desired threshold in the mixed 2,3-SAT case. We pay special attention to pure 2-SAT formulae. This is not because we want to contribute to the solution procedure for this class, as they can be solved in linear time. The reason why we do so is because the expressions involved are much simpler and yet the methods used generalize naturally to the other more complex cases. Although the paper is essentially self contained, the reader is supposed to be familiar with the SAT terminology, the Integer Programming Approach to the SAT-Problem as well as to some basic facts concerning the SAT problem. We refer to [2] for the above. For a detailed discussion of smooth convex models we refer to [4] and [5]. Related studies on smoothing binary programming problems are found in [3] and [6]. 1
We conclude with the remark that in spite of the less attractive expresssions that are created by dierentiating the smooth model, the purely combinatorial entities which show up are partly familiar in the SAT area (when dierentiating once) and partly de ne new (to the best of the author's knowledge) and interesting characteristics of CNF formulae.
2 The smooth concave model. In [4] and [5] smooth convex and concave models are introduced and discussed at length. In order to keep this paper self contained we shall (only very brie y) recall the relevant formulae. Since we only deal with the concave model here we omit superscripts o and * as they appear in the previously quoted papers. For " > 0 we consider A" : (?1; 1] ! [0; 1] given by (2.1)
p
2 A"(x) = x1++ px1 ++""
and for r < 0; Ar : [0; 1]S ! [0; 1] is de ned by (2.2)
Ar (x1; : : :; xS ) = ( S1
S X i=1
xri ) r1
The above A" and Ar are used to smooth the Binary Integer Programming Problem associated to the SAT problem. In fact, for " # 0 and r ! ?1 the concave model below converges to this BIP presentation.
2.1 Clause dependent notations and conventions.
A clause Cs is represented by two index sets Is and Js is such away that
(2.3)
Cs =
_
i2Is
pi _
_
j 2Js
pj
The length of a clause is #Is + #Js and is denoted by `s . Throughout the paper we assume that Is \ Js = ; and that `s 2, that is, no one literal clauses are considered. The concave transform of a clause Cs now is goven by (2.4)
Cs(x) = 1 ? A"s (1 ? (
X
i2Is
xi +
X
j 2Js
(1 ? xj )))
where "s is a parameter depending on the length `s of the clause only. We shall identify freely "s ; "`s or "m in cases where no confusion can appear and where m obviously refers to clause lengths. The value of Cs (x) at the center c = ( 21 ; 21 ; : : :; 12 ) of the cube is (2.5)
Cm = C`s (c) = Cs(c) = 1 ? A"m (1 ? 12 m)
2
2.2 CNF dependent notations and conventions
A CNF formula of N variables is a conjunction of clauses Cs (s 2 S ) where each Is and Js are subsets of f1; : : :; N g. Thus ^ (2.6) = Cs s2S
We let Sm S be the index set refering to the m-literal clauses of . In numerical expressions we shall also use S; S2; S3; : : : to denote the numbers #S; #S2; #S3; : : :. M shall stand for the maximal clause length appearing in . Thus S = S2 [ S3 [ : : : [ SM and S = S2 + S3 + : : : + SM both will have a meaning, being the obvious one. The concave transform of a CNF now is (2.7) (x) = Ar (C1(x); : : :; CS (x)) Notice that in the above one parameter r < 0 and M ? 1 parameters "2 ; "3; : : :; "M > 0 are involved. In [5] it is shown that a threshold value c exists, depending only on the parameters and the global characteristics of (the numbers S2 ; S3; : : :; SM ) such that (2.8) (V ) c if and only if is true at assignment V In other words, the smooth convex region de ned by the inequality (2.9) (x) c separates \true" vertices from \false" vertices (assignments). Again, we mention that for "s # 0 and r ! ?1, (2.9) converges to the BIP representation of the satis ability problem for . In this paper we investigate a weaker, but more accessable inequality, namely the one obtained by replacing (x) by its second order Taylor expansion at c, the center of the cube. Doing so, we shall establish a convex quadratic region in RN which has weaker discriminative properties but is much more adapted for calculations and is suitable for deriving linear valid cuts with speci c features. The value (c) shall frequently occur in our expressions. Notice that X (2.10) (c) = ( 1 ( S C r )) r1
S mM
m m
is a typical global characteristic of . In order to establish the second order Taylor expension of we have to dierentiate (x) twice. This activity is in itself a trivial one, but it generates, as one might expect, less attractive coecients. The combinatorial entities obtained however, are interesting and deserve extra attention. We shall list them here.
2.3 CNF dependent combinatorial entities. (2.11) (2.12) (2.13) (2.14)
POS (m; k) = #fs 2 Smjpk is a literal in Cs g NEG(m; k) = #fs 2 Smj pk is a literal in Csg DIF (m; k) = POS (m; k) ? NEG(m; k) DIF (m) = (DIF (m; 1); DIF (m; 2); : : :; DIF (m; N )) viewed as a column vector. 3
The above entities arise, establishing the rst order partial derivatives of (x), at c. The ones below constitute the ingredients of the second order partial derivatives. (2.15) EOR(m; i; j ) = #fs 2 Sm jpi and pj are oriented equally in Cs g (2.16) UOR(m; i; j ) = #fs 2 Sm jpi and pj are oriented dierently in Cs g (2.17) DIF 2 (m; i; j ) = EOR(m; i; j ) ? UOR(m; i; j ) (2.18) DIF 2 (m) = (DIF 2 (m; i; j )) viewed as a symmetric squared N N matrix: In the above it is understood that for i 6= j the propositional variables pi and pj both have to occur, and that for i = j the occurrence of pi counts as \pi and pj are oriented equally". Thus EOR(m; i; i) is simply the number of occurencies of variable pi in the m-literal clauses, and UOR(m; i; i) is always zero. We close this section by listing the frequently occurring coecients, arising from the dierentiation of (x).
2.4 Occurring coecients when dierentiating (2.19)
(2.20) (2.21) (2.22) (2.23) (2.24) (2.25) (2.26) (2.27)
the parameters r < 0 and "2 ; "3; : : :; "M > 0: m = 1 + p11 + " : m 1 m = 1 + q 1 ?1 2 m : (1 ? 2 m)2 + "m p1 + " ? 1 m m = 3 : ((1 ? 12 m)2 + "m ) 2 r 1 1 Cm = 1 ? m((1 ? 2 m) + (1 ? 2 m)2 + "m ): X SmCmr ) r1 : (c) = ( S1
um =
mM 1 (c)1?r C r?1 : m m m S
q
vm = S1 Cmr?1 mm (1 ? r)(c)1?2r = s = um 1(?c)r :
(2.28) wm = S1 (c)1?r (Cmr?1m + (1 ? r)Cmr?22m 2m): To understand the main goals and reasoning of this paper the reader need not go necessarily in the sometimes tedious details of the calculations, involving unattractive expressions as in the above. It is enough to realize that these are just \real numbers" showing up because of using A" and Ar as given in (2.1) and (2.2). Essential is however, to keep in mind that they are completely determined by the parameters and the global characteristics of the CNF formula involved. 4
3 The quadratic approximation. The second order Taylor expansion, taken at the center of the cube c, of our concave transform (x), can be written as (3.1) (c + ) = (c) + r(c): + 12 (c)( ): where r stands for gradient and for Hessian matrix. As the notorious reader may verify we have P u DIF (m; i) r = ( r ( c )) = i i m1 m (3.2) u = 1 (c)1?r C r?1 m
and
m
S
i;j = ((c))i;j =
(3.3)
vm wm
X m1
vmDIF (m; i)
X m1
vm DIF (m; j )
X ? wmDIF 2(m; i; j ) m1 q 1 = Cmr?1 m m (1 ? r)(c)1?2r S = S1 (c)1?r (Cmr?1m + (1 ? r)Cmr?22m 2m)
The above can be written alternatively as (3.4)
m m
'() = (c + ) ? (c) =
X
s2S
us C`s() + 12 (
X
s2S
vsC`s())2 ? 21
where subscript s in us ; vs and ws should be read as `s , and X X C`s() = i ? j : (3.5) i2Is
X s2S
wsC`s()2
j 2Js
The reader must be alert on the fact that in vertices V of the cube Vi = 12 + i with i = 21 . Thus, in the above, -coordinates referring to truth assignments have value 21 . More compact matrix notations for our entities are P r = m1 um DIF (m) (3.6) 1?r rrt ? P 2 = ( m1 wm DIF (m) c)
Example: For = (p _ q) ^ ( p_ r) ^ (p _ q _ s) ^ ( p _ r_ s) we have 0 1 0 1 0 BB 01 CC BB 1 CC r = u2 B@ ?1 CA + u3 B@ 1 CA 0
0
1 0 1 0 2 1 ?1 2 2 1 1 0 C B B 1 1 0 0 CC = 1(?c)r rrt ? (w2 B B@ 1 0 1 0 CA + w3 BB@ ?11 01 10 ?11 CCA) 2 1 ?1 2 0 0 0 0 5
and
'() = (u2(q ? r ) + u3(q + r )) + 12 (v2(q ? r ) + v3(q + r ))2
? 12 w2(2p2 + q2 + r2 + 2pq + 2pr ) ? 12 w3(2p2 + q2 + r2 + 2s2 + 2pq ? 2pr + 4ps + 2q s ? 2rs )
The aim of this paper is to establish numbers msat and Msat , depending only on the parameters r and "2; : : :; "M and the global characteristics of the CNF formula involved (the numbers S2 ; S3; : : :; SM ) in such a way that fj'() < msatg contains no satis able assignments (3.7) fj'() > Msatg contains only satis able assignments Thus, by knowing msat , the quadratic convex cut (3.8) '() msat shall be a valid cut. Also, speci c values of the parameters shall be given which make these cuts as discriminative possible. Allowing msat and Msat to depend on other characteristics (such as average numbers of occurrencies, number of pure clauses, average length of clauses, : : : ) as well, certainly will increase the discriminative properties of the cuts, however, the analysis will be likewise more involved, and we shall not proceed along these lines. Still, our methods to establish these values leave some space for more speci ty, as the reader may notice in the sequal.
4 The 2-SAT case. In this section we suppose to consist of only 2-literal clauses. This assumption has a considerable simplifying eect on our formulae. The reader may check that in this case (a) (b) (c)
(4.1)
and moreover, (4.2)
(c)1?r = C21?r 2 = 1 u2 = S1 2
s
(d)
v2 = 1 2 1 ? r
(e)
2 w2 = S1 (2 + (1 ? r) C2 )
S
C2
2
!
2 X X C`s() ? 21 w2 C`s()2: '() = u2 C`s() + 12 v22 s2S s2S s2S
X
6
As motivated earlier, we are now interested in approximating the values max '( ) and min '( ) ( c + binary c + binary (c + ) false (c + ) true only in terms of the parameters and global characteristics of (in this case only S since S3 = S4 = : : : = 0 and S2 = S ). Now, by reasons of symmetry, we may restrict ourselves to the values '(c), or in other words, we shall investigate
(4.3)
(4.4)
(
Msat = max '(c) ( (e) false
and
msat = min '(c) ( (e) true
is 2-CNF is 2-CNF Doing so we shall obtain an upperbound (a lower bound respectively) for the values of 4.3. For a 2-literal clause Cs we have (4.5) C`s(c) 2 f1; 0; ?1g depending whether Cs contains two positive literals, two literals of opposite sign, or two negative ones. That is, if we assume that consists of (4.6) we obtain
8 > < S2;2 clauses having 2 positive literals S clauses having 1 positive literal > : S22;;10 clauses having 0 positive literals
(4.7) '(c) = u2(S2;2 ? S2;0) + 21 v22(S2;2 ? S2;0)2 ? 12 w2 (S2;2 + S2;0): The above expression appears to be a convex function of S2;2 and S2;0 and the two values of 4.4 can now be established by solving (4.8)
Msat = max '(c) and 8 > < S2;2; S2;0 0 > SS2;2 + S12;0 S2 : 2;0
msat = min '(c) 8 > < S2;2; S2;0 0 > SS2;2 += S02;0 S2 . : 2;0
Both problems are easily solved. The rst by notifying that a convex function on a simplex attains its maximal value in a vertex. The latter because it is a one dimensional smooth quadratic convex minimization problem. It is understood here that we consider S2;2 and S2;0 to be real variables. We proceed with rewriting the two problems using
x = SS2;2 ; y = SS2;0 ; u2 = Su2; v 2 = Sv2; w2 = Sw2: 7
Notice that u2 ; v2 and w2 only depend on "2 and r. The second problem now reads as msat = 0min (4.9) u x + 12 v22 x2 ? 21 w2x x1 2 which has a solution
8 < S2;2 = 114 S2 S = S > : S22;;10 = 124 S22
that is, the expected value of '( ), at a vertex c + , is (4.14) 'exp = u2 ( 14 ? 41 ) + 12 v22 ( 41 ? 14 )2 ? 21 w2( 14 + 41 ) = ? 14 w2: 8
Thus parameter settings resulting in
? 14 w2 < msat (4.15) are favourable for our purposes. A plausible choice, which also simpli es the formulae involved, is to select "2 and r such that w2 = 2u2. This poses the question whether (see 4.1) 2
(4.16) 2 + (1 ? r) C2 = 22 2 is solvable for " > 0 and r < 0. The reader is invited to con rm that in fact "2 = 1 and 2 p r = 1 ? 2 constitute a solution to 4.16 and that in this case 4.12 simpli es to
8 > < (c + ) true ) (4.17) > : (c + ) false )
1P 1 P 2 1P 2 S s2S C`s ( ) + 2S 2 ( s2S C`s ( )) ? S s2S C`s ( ) 0
1P 1 P 2 1P 2 1 S s2S C`s ( ) + 2S 2 ( s2S C`s ( )) ? S s2S C`s ( ) 2 : When not relaxing for the constraint y S1 in 4.11 a sharper result (valid for S 3) is obtained: the right hand side of the second inequality may be taken as 21 ? S4 + S22 instead of 21 . p We see that putting "2 = 1 and r = 1 ? 2 results in rather attractive formulae, a fact which
invites us to present some examples. P First = (p _ q ) ^ (p_ q ) ^ ( p _ q ) ^ ( p_ q ). Here s2S C`s ( ) = 0 and P consider 2 2 2 s2S C`s ( ) = 4p + 4q . Hence 4.17 reads as
(c + ) true ) p2 + q2 0 (c + ) false ) p2 + q2 ? 21 : The rst implication yields unsolvability since p2 + q2 = 12 at any vertex c + . P More generally, if is perfectly balanced in signs (which exactly means that s2S C`s ( ) = 0) our rst cut reads as (4.18) (c + ) true ) C`s ( ) = 0; for any s 2 S: The above means that such formulae are extremely simple to solve, because the disjunction in any clause should be interpreted as an exclusive or. The next example reveals some of the geometry associated with our quadratic convex cuts. N.B. N does not denote the number of variables in this example. Let be the conjuction of
p_q p_ q p_q p_q .. .
p_q
9 > > = N times > > ; 9
We rst calculate '( ):
!
!
(N ? 2)2 ? 1 2 + N 2 ? 1 2 + '() = (4.19) p q 2(N + 2)2 2(N + 2)2 2N (N ? 2) + 2 ? N + N : ? + N2N + 2 2(N + 2)2 p q N + 2 p N + 2 q Figures 1 and 2 show the regions ( ) = 0 for N = 1; 10; 50 and N = 6; 7; 8 respectively. Analyzing the N = 1 case we see that linear cuts p > ? 21 and q > ? 12 are obtained by minimizing p and q respectively over the region '( ) 0. These cuts precisely correspond to the resolvents p and q , when applying resolution to . Deriving linear cuts perpendicular to the axes of the ellipsoid '( ) 0 we obtain p ? q > ?1 and p ? q < 1 (perpendicular to the short axis) and p + q > 0 (perpendicular to the longer axis, at (0,0)). Now the rst two imply the clauses p_ q and p _ q respectively (clauses which were already present here). The latter in fact implies p = q = 1. Notice that our quadratic cut in fact separates satis able vertices from non satis able ones. When N increases the discriminative properties of the cut gradually vanish, as the pictures show. However, the centre of the ellipsoid (given by the coordinates N 2N+7+2N +2 (1; N )) is always in the right quadrant! p Although the setting "2 = 1; r = 1 ? 2 gives attractive formulae and meets the requirement that the cuts '( ) msat exclude the \average" non satis able assignment, other settings exist which also meet this last requirement. We present another possible choice which gives more freedom when considering mixed 2; 3? CNF formulae and which was in fact used in our experiments of [5]. By posing the condition that we want Msat = 0 the equation (4.20) u2 + 12 v22 ? 21 w2 = 0 is investigated and turns out to simplify to (4.21) 2 = 21 2 which is a requirement not depending on parameter r, and is satis ed precisely if "2 = 41 . This setting meets 4.15 as well and results in the parametrized family of cuts
8 1P P P > < S s2S C`s() + 2S1 2 r ( s2S C`s())2 ? 21S (2 + r) s2S C`s()2 ? 18 r (4.22) > : r = 21 (1 ? r)(p5 ? 1); r < 0: For r ! ?1 the above results in another nice cut: 1 ( X C` ( ))2 ? 1 X C` ( )2 ? 1 : (4.23) s 2S 2 s2S s 2S 8
Notice that in the above the eect of the rst order term has vanished! 10
1 q 0.8 0.6 0.4 0.2 0 p −0.2 −0.4 −0.6 −0.8 −1 −1
−0.8
−0.6
−0.4
−0.2
0
0.2
0.4
0.6
0.8
1
Figure 1: N=1 (solid), N=10 (dashed), N=50 (dotted)
1 q 0.8 0.6 0.4 0.2 0 p −0.2 −0.4 −0.6 −0.8 −1 −1
−0.8
−0.6
−0.4
−0.2
0
0.2
0.4
0.6
0.8
1
Figure 2: N=6 (solid), N=7 (dashed), N=8 (dotted)
11
5 The 3-SAT-case. In this section we shall deal with pure 3-CNF formulae. We shall ask ourselves the same questions as for the 2-SAT case. It will appear that things become slightly more involved and that there is considerably less freedom in selecting appropriate parameters. We start with simplifying our formulae. (c)1?r = u3 = v3 = w3 =
(5.1) and (5.2)
'() = u3
Again we put
X s2S
C31?r 1 S 3 3q 1?r 1 S 3 3 C3 2323 1 S (3 + (1 ? r) C3 )
C`s () + 12 v32(
X
s2S
C`s ())2 ? 12 w3
X s2S
C`s()2:
Msat = max '(c) and msat = min '(c) ( ( (e)false (e)true is 3-CNF is 3-CNF .
(5.3)
Now for a 3-literal clause Cs we have (5.4) We de ne, as in 4.6, (5.5)
C`s() 2 f 32 ; 21 ; ? 21 ; ? 32 g: S3;i = #fs 2 S jCs contains exactly i positive literalsg
obtaining
(5.6)
'(c) = u3( 32 S3;3 + 12 S3;2 ? 21 S3;1 ? 32 S3;0) + 1 23 1 1 3 2 2 v3 ( 2 S3;3 + 2 S3;2 ? 2 S3;1 ? 2 S3;0) ?
1 w ( 9 S + 1 S + 1 S + 9 S ): 2 3 4 3;3 4 3;2 4 3;1 4 3;0 Again, the above expression is a convex function of the S3;i and 5.3 can be solved similarly (but more tediously) as in the 2-CNF case. Following the same lines (but everything with a extra dimension) we obtain: (5.7)
8 1 v2 ? 1 w ? 1 u if 2u3 ? w3 v32 > < 98 32 89 3 23 3 if 2u3 ? w3 ?3v32 msat = > 8 v33 ? 8 w31 + 2 u3 2 : ? 8 w3 ? 8v2 (2u3 ? w3) else 3
and, for S ! 1 12
(5.8)
83 > u + 9 v2 ? 9 w > < 221 u33 + 188 v332 ? 881 w33 Msat = maximum of > ? 1 u + 1 v2 ? 1 w > : ? 23 u3 + 89 v32 ? 98 w3: 2 3
8 3
8 3
Now we have completed the establishment of msat and Msat as functions of r and "3 and hence we obtained a parametrized family of quadratic convex cuts for the (pure) 3-SAT problem. Next we investigate whether it is possible to select r and "3 in such a way that the cut '( ) msat excludes the \average" non satis able assignment. For a random 3-CNF formula the expected values of the S3;i are
8 > S = 1S > < S33;;23 = 388 S33 3 > > : SS3;1 == 18 SS3
(5.9)
3;0
whence (5.10)
8 3
'exp = u3( 18 : 32 + 83 : 12 + 83 : ? 12 + 18 : ? 23 ) + : : : ? 21 w3 ( 94 : 28 + 14 : 68 ) = ? 83 w3:
Investigating the requirement (5.11)
? 83 w3 < msat
excludes the possibility 2u3 ? w3 2= [?3v3; v32], as the reader may verify. The third possible case in msat also violates the above requirement, however, by putting the condition (5.12) 2u3 = w3 it comes as close as possible. We conclude that we cannot nd a parameter setting meeting our strict requirement of 5.11 but that, by putting 2u3 = w3 the \average" non satis able assignment is on the boundary of the cut '( ) msat . We can be slightly more detailed here: In establishing msat in the 2u3 = w3 setting it appears that the minimal value of the associated convex minimization problem is obtained exactly only if 43 S is a natural number. This means that in fact the \average" non satis able assignment may appear on the boundary of '( ) = ? 83 w3 but that the satis able assignments are in the interior of the cut, that is, they meet the requirement '( ) > ? 83 w3 if S is not a multiple of 4. The condition 2u3 = w3 reads as (5.13) (233 ? 3 )C3 = (1 ? r)23 23 which in fact de nes a bounded solution set in r < 0 and restricts "3 to values exceeding approximately 0.55. Putting "3 such that 13
(233 ? 3)C3 (= 1 ? r) > 1
(5.14)
we obtain the quadratic convex cut
23 23
8 1P 1 ( )(P C` ( ))2 ? 1 P C`s ( )2 ? 3 > C` ( ) + s 2 s 2 S s2S s S S s2S 4 2 S < > : " = 2 ? 333 = 2 ? ("3+ 14 )(p"3"3 + 14 ? 12 )
(5.15)
where the inequality can be taken strict if S is not a multiple of 4. The above simpli es when we put 3 = 3 3 = u3 = 12 w3 = v23 , which is satis ed if
(
(5.16)
p
"3 = 34 + 12 3 r = 1 ? C33 = ?1:762551985:
Then 5.14 is met and " = 1 in 5.15. For these parameter values Msat = 38 u3 = 83 3 3. We resume: For "3 and r as in 5.16 we obtain the following convex quadratic cuts: (5.17)
8 P P P > < (c + ) true ) S1 s2S C`s () + 2S12 ( s2S C`s())2 ? S1 s2S C`s()2 ? 34 > : (c + )false ) S1 Ps2S C`s() + 2S1 2 (Ps2S C`s())2 ? S1 Ps2S C`s()2 83 ? S5 + S22 : Yet another possibility is "3 ! 1 in 5.15, resulting in = 2. Corresponding r for this last setting is r = ?2.
The rst example we consider is to arm our conclusions about the impossibility of strictly separating satis able assignments from unsatis able ones using our quadratic convex cuts. Let be the conjunction of
Then
P C` () = 0 and 5.2 becomes
p_q_r p_ q_ r p _ q_ r p_ q _ r:
s
'() = ? 21 w3(4p2 + 4q2 + 4r2) = ? 32 w3 = ? 83 w3
We see that '( ) is constant on the vertices, of which there are satis able and non satis able ones. Notice that our rst cut of 5.17 is p2 + q2 + r2 43 which de nes a sphere with centre 0 containing all vertices of the cube ?c +[0; 1]3 in its boundary. 14
If is the conjunction of all eight clauses ()p _ ()q _ ()r we obtain precisely the same cut. Thus "spheres" may represent satis able formulae as well as non satis able ones. We shall discuss the next example at some length. It demonstrates what can be done with quadratic cuts, using the eigenvalue and eigenspace structure of 4. Let be the conjunction of
p_q_r p _ q_ r p_ q _ r p_ q_ r p_q_r p _ q_ r Here, '( ) ? 43 can be written as
(5.18)
17p2 + 17q2 + 18r2 ? 14pq ? 6p ? 6q 27 2:
Eigenvalues are 10, 24 and 18 with corresponding eigenvectors (1,1,0),(-1,1,0) and (0,0,1). Centre of the ellipsoid is ( 103 ; 103 ; 0). The above cut is separating with respect to true and false assignments. Transforming ?space to -space (where the eigenvectors are taken as a base) through p 1 = 21 p2(p + q ) 2 = 21 2(p ? q ) 3 = r transforms 5.18 into p 10(1 ? 3 2)2 + 2422 + 1832 153 : (5.19) 10 10 1 2 Now 3 = 4 in all vertices, whence 5.19 reduces to 3 p2)2 + 24n2 108 : (5.20) 10(1 ? 10 2 10 108 which means We obtain 22 240 (p ? q )2 216 (5.21) 240 < 1: This implies p = q in a satis able vertex. Now 5.20 reduces to p (5.22) (1 ? 3 2)2 108 100 p 1 p 10 3 which in turn means 1 10 2 ? 10 108, yielding 6 ? 1 p216 > ? 9 > ?1 (5.23) 2p = p + q > 10 10 10 1 which nally leads to p = 2 . 15
6 General and mixed 2,3-SAT case.
In this section we shall only be concerned with the convex quadratic cut '( ) msat. Again we notice that an estimation msat is desired which only depends on S2; S3; : : :; SM and that by reasons of symmetry we may restrict ourselves to solve the problem
msat =(min '(c) (e) true has Si clauses with i literals.
(6.1)
Now C`s (c) = 12 (Is ? Is ) 2 [?ls ; ls]. Let (6.2) then (6.3)
8 > < Sm = Sm;0 + Sm;1 + : : : + Sm;m (j = 2; : : :M ) S is the number of clauses with m literals of which i are positive > : Sm;i m;i 0 '(c) = Pm;i 21 um (2i ? m)Sm;i + 21 (Pm;i 21 vm(2i ? m)Sm;i)2
? 21 (Pm;i 14 wm(2i ? m)2Sm;i)
and, again, the above expression is seen to be convex in the Sm;i and must be minimized under the constraints given in 6.2 and the additional constraints Sm;0 = 0) (m M ). The above problem is a 12 M (M ? 1) dimensional problem and can be actually solved adequately using an accurate convex programming solver if M is not too large and r and "m (m M ) are speci ed! We shall concentrate ourselves to M = 3, thus restricting ourselves to the mixed 2,3-SAT case. Also, we shall x r; "2 and "3 to the values of 4.21 and 5.16. We are then left with a 3-dimensional convex programming problem which we have solved by hand. We shall not present these tedious calculations here and just give the results. Below we use the real numbers (6.4) and the global characteristic (6.5)
2 = 2C2r?1 ( 0:9835:::) 3 = 3C3r?1 ( 0:7349:::)
= (1 ? r) C22 ( 1:7073:::) ! = !(S2; S3) = SS2 C2r + SS3 C3r :
It turned out that msat depends on the value SS32 according to the following three cases:
(6.6)
case 1 : SS2 23 3 2 S 2 3 case 2 : 2 S (1 + ) 3 2
3
case 3 : SS2 (1 + ) 3 3 2 16
2
and has the corresponding values (6.7)
8 3 S 1 ?1 1 2 > ? 4 3 S3 ! r ? 2 (1 ? r)22 3 SS2S2 3 ! 1r ?2 ? 21 (1 ? r)22 SS22 ! r1 ?2 case 1 > > < ? 3 3 S3 ! r1 ?1 + 1 (1 ? r)23 S32 ! 1r ?2 ? 1 (1 + )(1 ? r)23 S2S2 3 ! 1r ?2 case 2 2 S S msat = > 34 SS3 1 ?1 18 2 1 1 ?2 S S S ? 2 1 2 3 2 3 ? 3 ! r ? ( + 2)(1 ? r)3 S2 ! r ? 4 (1 + )(1 ? r)23 S2 ! r > > : ? 14 (1 ?S r)22 S222 !81r ?2 case 3: 8 S
The reader is invited to con rm that for S3 = 0 and S2 = 0, msat takes the values derived for the separated cases respectively. Also in this case one can show (as in 4.14 and 5.10) that the "average" non satis able vertex c + does not satisfy '() > msat. In fact, it does not satisfy '() msat as soon as S2 6= 0. However, it is questionable whether this is a useful observation, since mixed 2,3-SAT formulae are typically appearing when solving a 3-SAT problem and as such generally are not random.
7 Geometric evidence for the hardness of balanced formulae. Numerical experiments (Dubois [1]) have shown that random 3-SAT formulae in the hard region tend to become even harder in so called balanced cases. We believe that our quadratic convex cuts explain this feature quite well. First, we notice that for random formulae , the eigenvalues of (c) = (ij ) are generally nonzero and consequently negative. This means, that the inequality '( ) msat de nes an ellipsoid, the center C of which is given by the solution of (7.1)
(C ) = ?r
Now a formula which is balanced in sign de nes an approximately zero gradient r. Therefore, the linear term in '( ) vanishes and consequently C = 0. Thus the geometrical picture of a sign balanced formula is an ellipsoid with centre 0. In this case, rst order heuristics based on the Taylor expansion of (x) at the center yields no information and, in case of pure 3-SAT formulae, '( ) msat simpli es to (we use the parameter setting of 5.16 and 5.17): X C`s()2 43 S (7.2) s2S In the above case, however, the length of the axes may dier. Next, the reader is invited to con rm that for pure 3-SAT formulae the diagonal terms of are given by POS (3; i) ? NEG(3; i) POS (3 ; i ) + NEG (3 ; i ) 2 i;i = u3 ( (7.3) ) ? 2( ) S S
Noticing that in the above expression the quadratic term is much smaller than the linear term, we see that formulae which are balanced in occurrences of the variables the diagonal terms of are approximately equal. Eigenvalues of , therefore, are approximately equal too. Hence the geometrical picture of an occurrence balanced formula is a sphere. However, in this case, its centre need not be zero necessarily and hence rst order heuristics may yield pro t. 17
Now a formula which is balanced for both features has as its geometrical picture a sphere with centre zero. The inequality of 7.2 now has all coecients of the i2 approximately equal. Still, the o diagonal terms may cause some slight dierence in the length of the axes! Consider for instance the double balanced formula = (p _ q _ r) ^ (p_ q _ r) ^ ( p _ q _ r) ^ ( p_ q _ r) Here, '( ) msat simpli es to 4p2 + 4q2 + 4r2 + 8q r 3 which de nes an elliptic cylinder (there is a zero eigenvalue here, with eigenvector along q = r ). The eigenvector of the largest eigenvalue (notice that the inequality sign has reversed) is along q = ?r . In fact, the above inequality yields q r 0 immediately, implying that q $ r is a necessary condition for satis ability. The above means that, from a geometric point of view, double balanced formulae do not represent typically the hardest possible cases. However, if the o diagonal terms tend to vanish too, that is, if is moreover pairwise balanced in sign, meaning DIF 2 (3; i; j) 0 for all i 6= j (7.4) the geometrical picture is an almost perfect sphere with centre 0 and 7.2 simpli es to the non informative inequality 3S 1 3S X 2 4 i ?nr. of occurrences (7.5) 43S = 4 N iN N of the variables We encourage the experimentalists to test the hardness of the above type of random 3-SAT formulae.
References [1 ] O. Dubois. Double balanced formulae. 1996. DIMACS SAT workshop. Preliminary Release Note. [2 ] J. Gu, P.W. Purdom, J. Franco and B.W. Wah. Algorithms for the Satis ability (SAT) Problem: A Survey. 1996. DIMACS SAT workshop. Preliminary Release Note. [3 ] L.Lovasz and A. Schrijver. 1991. Cones of matrices and set-functions and 0-1 optimization. SIAM J. Optimization. Vol. 1, No 2, 166-190. [4 ] H. van Maaren, J.F. Groote and M. Rozema. 1995. Veri cation of propositional formulae by means of convex and concave transforms. Techn. rep. TU Delft, nr. 95-74. [5 ] H. van Maaren. 1996. On the use of second order derivatives for the Satis ability Problem. Techn. rep. TU Delft, nr. 96-34. [6 ] H.D. Sherali and W.P. Adams. 1990. A hierarchy of relaxations between the continuous and convex hull representations for zero-one programming problems. SIAM J. Disc. Math. Vol. 3, No. 3, 411-430. 18