Proceedings of the 46th IEEE Conference on Decision and Control New Orleans, LA, USA, Dec. 12-14, 2007
FrC14.1
Stability and Controllability of Planar, Conewise Linear Systems Ari Arapostathis and Mireille E. Broucke
Abstract— This paper presents a fairly complete treatment of stability and controllability of piecewise-linear systems defined on a conic partition of R2 . This includes necessary and sufficient conditions for stability and controllability, as well as establishing that controllability implies stabilizability by piecewise-linear state feedback. A key tool in the approach is the study of the Poincar´e map.
I. I NTRODUCTION This paper studies stability and controllability of piecewise-linear systems defined on a conic partition of R2 , which we call conewise linear systems (CLS). We derive necessary and sufficient conditions for stability and for controllability, as well as establish that controllability implies stabilizability via piecewise-linear state feedback. The analysis relies on the study of the Poincar´e map. As long as the standard assumptions are posed concerning the lack of trajectories following unstable eigenvectors or unstable sliding modes, the properties of the Poincar´e map are the determining factor in stability. The Poincar´e map is again used to study controllability, thus providing a unifying theme. Assuming there are no one-dimensional controlled invariant subspaces or half-lines (those on sliding surfaces), a Poincar´e type map of the boundary of the funnel of the controlled trajectories provides necessary and sufficient conditions for controllability. Pachter and Jacobson [1] also obtain a necessary and sufficient condition for stability of switched linear systems in the plane with conic switching by calculating the gain of a Poincar´e map. In this paper we go one step further by obtaining explicit algebraic expressions for what we refer to as the characteristic values of the CLS. Roughly speaking, for a CLS there are two mechanisms that lead to stability or instability. One is the effect of the time-average of the eigenvalues of the individual linear components on each partition weighted by the fraction of the time that trajectories spend on each partition. The other is induced by the non-commutativity of the individual linear maps. The expressions obtained in this paper distinguish between the two components and thus shed some new light on the issue of stability. The first author is supported in part by the Office of Naval Research through the Electric Ship Research and Development Consortium and in part by the National Science Foundation under Grant ECS-0424169. The second author is supported by the Natural Sciences and Engineering Research Council of Canada. Ari Arapostathis is with the Department of Electrical and Computer Engineering, University of Texas, Austin TX 78712, U.S.A.
[email protected] Mireille Broucke is with the Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON M5S 3G4, Canada
[email protected].
1-4244-1498-9/07/$25.00 ©2007 IEEE.
II. P RELIMINARIES In this section, we present some preliminary definitions and results. In particular, we show that if a closed, convex cone contains no subspaces and no eigenvectors of the system matrix, then all trajectories escape the cone. Definition 1: Let x˙ = Ax be the dynamics on a convex cone K of Rd . We define an eigenvector of A to be visible ¯ the closure of K. if it lies in K, The following result appeared in [2] and relies on Lefschetz’s fixed point theorem. Lemma 1 (Pachter, [2]): Let K be a non-empty closed convex cone in Rd but not a linear subspace. If K is invariant under the semigroup {eAt }, i.e., eAt K ⊂ K for all t ≥ 0, then K contains an eigenvector of A. Lemma 1 clearly implies the following result. Its relevance is in enabling us to argue that the characteristic values computed in Section III are well-defined. Theorem 1: Let K be a closed convex cone in Rd , and suppose K does not contain a subspace of Rd . Suppose no eigenvectors of A ∈ Rd×d lie in K. Then for any initial condition x0 ∈ K, x0 6= 0, there exists t0 ∈ R such that / K. eAt0 x0 ∈ Proof: Suppose that for some non-zero initial condition ˆ denote the maximal x0 ∈ K, eAt x0 ∈ K, for all t ≥ 0. Let K invariant set under the semigroup {eAt } contained in K; that ˆ is formed by the union of trajectories that lie in K for is, K ˆ= all t ≥ 0. Clearly K 6 ∅, and since the dynamics are linear, ˆ is also a closed convex cone. Moreover, it is evident that K ˆ is not a subspace since K does not contain a subspace K ˆ contains an eigenvector of A, of Rd . Thus, by Lemma 1, K leading to a contradiction. III. S TABILITY In this section we define the characteristic values of a planar CLS and express them as explicit functions of the system parameters. The method amounts to computing the growth of trajectories over one cycle around the origin and using this parameter to obtain the asymptotic behavior of the CLS. Let A = {Aj ∈ R2×2 , j = 1, . . . , k0 } be a collection of matrices and let {v1 , . . . , vℓ+1 } be a set of unit vectors in R2 directed counterclockwise such that vℓ+1 = v1 . We define Θ(· , ·) to be the angle in radians between two vectors in R2 in the counterclockwise sense, and assume, without loss of generality, that Θ vi , vi+1 < π. Let {K1 , . . . , Kℓ } be a set of open convex cones that form a partition of R2 such that Ki is generated by {vi , vi+1 }. On each Ki we have the dynamics x˙ = Ai x with Ai ∈ A. We denote the resulting CLS by Σ = {(Σi , Ki ) , i = 1, .. . , ℓ} where Σi denotes the dynamics on Ki . Let J = 10 −10 and define the index set I = {1, . . . , ℓ}.
6268
46th IEEE CDC, New Orleans, USA, Dec. 12-14, 2007 For i ∈ I, we define Vi = {λvi : λ ∈ (0, ∞)}. Let ni denote the unit vector orthogonal to Vi satisfying nT i vi+1 > 0 (i.e., {n1 , . . . , nℓ } is a collection of unit normal vectors to {V1 , . . . , Vℓ } ordered counterclockwise). The asymptotic behavior of the system Σ is determined by the visible eigenvectors, sliding modes, and by the trajectories which encircle the origin. First we place conditions on the visible eigenvectors and sliding modes to insure stability. Let △
T α+ i = ni Ai vi ,
△
T α− i = ni+1 Ai vi+1 ,
i∈I.
FrC14.1 Θ(v ′ ,v ′
0 λi
holds with
τi =
(III.1)
λ′i + λ′′i 2 v′ v′ 1 i2 . βi = log i1 ′′ v ′′ 2 vi1 i2
△
Clearly, all trajectories that lie on Vi are asymptotically − stable if and only if ξi < 0. In the case α+ i = αi−1 = 0, vi is an eigenvector of both Ai and Ai−1 , and as a result all trajectories that lie on Vi are asymptotically stable if and only if the corresponding eigenvalues are both negative. We summarize this in the following lemma. Lemma 2: In order for Σ to be asymptotically stable it is necessary that (i) All visible eigenvectors are associated with stable eigenspaces. − + − 6 0, then ξi < 0, i.e., (ii) If α+ i αi−1 ≤ 0 and |αi | + |αi−1 | = all sliding modes are stable. Next we compute the time needed for a trajectory to transverse a cone, as well as its growth in the cone. These calculations are used later to determine the asymptotic behavior of the trajectories that encircle the origin. Fix i ∈ I. Suppose that Ai has no visible eigenvectors relative to Ki . Without loss of generality we may assume that α+ i > 0. ¯ i is invariant Then necessarily α− > 0, for otherwise K i under the semigroup {eAi t }, and by Lemma 1 must contain an eigenvector of Ai contradicting the hypothesis. Thus the trajectory of Σi starting at vi exits the cone crossing the set Vi+1 in finite time by Theorem 1. We consider three cases depending on the Jordan form of Ai . Case 1: Ai ∈ R2×2 has a pair of complex eigenvalues λi ± jωi . Let Pi ∈ R2×2 denote the transformation such that Ai = Pi λi I + ωi J Pi−1 . The time τi that it takes the system z˙ = (λi I + ωi J)z to traverse the cone {Pi−1 vi , Pi−1 vi+1 } Θ(Pi−1 vi , P −1 vi+1 ) is τi = . This is the same as the time that ωi it takes the original system x˙ = Ai x, with x(0) = vi , to traverse Ki . We define: vi′ = Pi−1 vi ,
vi′′ = Pi−1 vi+1 ,
and αi = λi ,
βi = log
A simple computation yields x(τi ) = eµi τi vi+1 ,
kv ′ k i
.
(III.3)
βi , τi
(III.4)
kvi′′ k
µi = αi +
(III.2)
v ′ v ′′ 1 i2 i1 log ′ v ′′ λ′i − λ′′i vi1 i2
αi =
− + − If α+ 6 0, let ri ∈ [0, 1] denote i αi−1 ≤ 0 and |αi | + |αi−1 | = − the (unique) number satisfying ri α+ i + (1 − ri )αi−1 = 0. Let
ξi = viT (ri Ai + (1 − ri )Ai−1 )vi .
)
i i+1 . where τi = ωi Case 2: Ai ∈ R2×2 has two distinct real eigenvalues λ′i > λ′′i . 2×2 Let denote the transformation such that A = ′Pi ∈ R λi 0 −1 Pi and define vi′ , vi′′ by (III.2). Then (III.4) Pi ′′
(III.5)
Note that since Ki contains no eigenvectors of Ai it has to be the case that vi′ and vi′′ have the same sign (component ′ ′′ wise). Also vi1 vi2 6= 0, otherwise Ki contains an eigenvector of Ai . Therefore formulas (III.5) are well-defined. Case 3: Ai ∈ R2×2 has a real eigenvalue λi of multiplicity 2 in its minimal polynomial. Let ∈ R2×2 denote the transformation such that Ai = λ P1i i Pi Pi−1 and define vi′ , vi′′ by (III.2). Then (III.4) 0 λi holds with ′′ ′′ ′ ′ vi1 vi1 1 vi1 vi1 τi = ′′ − ′ = ′ ′′ det ′′ ′ vi2 vi2 vi2 vi2 vi2 vi2 αi = λi βi = log
(III.6) v′ i2
′′ vi2
.
′ ′′ Note that vi2 vi2 6= 0, otherwise Ki contains an eigenvector of Ai . The following may be proved by direct computation so we omit the proof. Lemma 3: The expressions for αi and βi are independent of the choice of the Pi ’s. We now present the main result of this section. Let X τi . τ= i∈I
Theorem 2: The planar CLS Σ = {(Σi , Ki ) , i = 1, . . . , ℓ} is asymptotically stable if and only if (a) Conditions (i) and (ii) of Lemma 2 hold. (b) If there are no visible eigenvectors or sliding modes, then with τi , αi , and βi as defined in (III.3), (III.5), (III.6), X (τi αi + βi ) µ := < 0. (III.7) τ i=I Proof: First show that if there are no visible eigenvectors or sliding modes, then (III.7) is necessary and sufficient. Without loss of generality suppose that α+ 1 > 0. Then, as mentioned in the paragraph following Lemma 2 we must + + have α− 1 > 0, and hence also α2 > 0. By induction αi > 0, − αi > 0 for all i ∈ I. Thus the trajectory x of Σ satisfying
6269
46th IEEE CDC, New Orleans, USA, Dec. 12-14, 2007 x(0) = v1 , encircles the origin and crosses V1 at time τ . Using the results in Cases 1–3 above, we have
kx(τ )k = Pℓ eBℓ τℓ Pℓ−1 · · · P1 eB1 τ1 P1−1 v1
= eµ1 τ1 Pℓ eBℓ τℓ Pℓ−1 · · · P2 eB2 τ2 P2−1 v2 (III.8) .. . = eµτ kvℓ+1 k = eµτ , △
where Bi = Pi−1 Ai Pi , i ∈ I. Therefore, kx(kτ )k = ekµτ , for all k ∈ N, implying that (III.7) is necessary. It is also sufficient since if x ˆ ∈ R2 \ {0}, then ̺ˆ x ∈ {x(t) : 0 ≤ t < τ }, for some ̺ > 0. Thus, the trajectory starting from x ˆ converges asymptotically to 0, provided (III.7) holds. Necessity of (a) is asserted in Lemma 2. It remains to show that if Σ has visible eigenvectors or sliding modes, then (a) is sufficient. It is evident that in this case, a trajectory cannot revisit a cone it exits. Therefore it has to get trapped in some cone Ki after some time t0 . Then necessarily either Ai has a visible eigenvector relative to Ki , or there is a stable sliding ¯ i . In both cases it is fairly straightforward to show mode in K that the trajectory converges asymptotically to the origin. It is also evident that trajectories are bounded uniformly over any bounded set of initial conditions. This completes the proof. Remark 1: 1) When there are no visible eigenvectors or sliding modes the stability of Σ is determined by the complex numbers µ ± jω, where 2π . (III.9) ω= τ Thus, we call them the characteristic values of the CLS. P P 2) Let β = i∈I βi and α = i∈I τi αi . If β = 0, then stability results if α < 0; that is, the time-average of the eigenvalues is negative. Likewise, if λi = 0 for all i ∈ I, then stability depends only on β, which is independent of the eigenvalues of the individual matrices Ai . A further examination of the constituent terms in µ and their relation to the work in [3], [4] can be found in [5]. IV. C ONTROLLABILITY Consider a controlled CLS Σ whose dynamics are specified by x(t) ˙ = Ai x(t) + bi u(t) on Ki , where Ai ∈ R2×2 , 2 bi ∈ R and u(t) ∈ R. As before, Σi denotes the restriction of Σ on Ki . For i ∈ I, define Bi = span{bi }. We present a rather complete characterization of controllability of this △ system on R2∗ = R2 \ {0}, the punctured plane which does not include the origin. The punctured plane is also used in the analysis of controllability of bilinear systems [6]. We can also develop a controllability theory for the full plane but this requires a more complicated analysis of trajectories that can cross through 0, and studying the well-posedness of trajectories that pass through a vertex of a partition. Let U be a set of controls. if x′ , x′′ are two points in R2∗ , we say that x′ can be steered to x′′ over U, and denote this
FrC14.1 by x′ x′′ , if there exists a u ∈ U and T > 0, such that the controlled system admits a unique solution in R2∗ satisfying x(0) = x′ and x(T ) = x′′ . Solutions are meant in the sense of Filippov. If D ⊂ R2∗ , then x′ D means that x′ x′′ ′′ for all x ∈ D. We say that Σ is completely controllable on R2∗ if x′ R2∗ for all x′ ∈ R2∗ . Also, we say that Σi is completely controllable if any two points in Ki can be joined through a trajectory in Ki . Difficulties with existence and uniqueness of solutions for discontinuous systems are well known [7]–[9], and several solution concepts have been proposed to overcome them [10], [11]. Here we highlight by way of an example the difference between open-loop and closed-loop controls with respect to uniqueness of solutions of discontinuous systems. Consider the one-dimensional system ( u if x < 0 x˙ = (IV.10) −u if x > 0 . Suppose x(0) = −1. Clearly there is no continuous feedback control u which can steer the system to 1. On the other hand under the feedback u = 1, if x < 0 and u = −1, if x > 0, the closed loop system has a unique trajectory x(t) = t − 1, and hence −1 is steered to 1 on [0, 2]. Along this trajectory the control u takes the values u(t) = 1, for t ∈ [0, 1], and u(t) = −1, for t ∈ [1, 2]. However, using this u as an open-loop control in (IV.10) we observe that there is loss of uniqueness of the solution, and as a result −1 cannot be steered to 1 by this open loop control. Apropos the above discussion, we consider two classes of control inputs: a) the set of all bounded measurable feedback controls u : R2∗ → R, which is denoted by Um , and b) the set of all piecewise-continuous open-loop controls u : [0, ∞) → R, denoted by U. In Section IV-A we study controllability over U of the subsystems Σi , i ∈ I. We also establish that the reachable sets of Σi over U are also reachable over the class of constant gain linear feedback controls. In Section IV-B we study the reachability from cone to cone, over Um . In Section IVC we combine these results to obtain necessary and sufficient conditions for controllability of Σ over Um . A slight strengthening of these conditions renders them necessary and sufficient for controllability of Σ over U; results can be found in [5]. In what follows we work with a refinement of the original partition which enables a simplification of the results on reachability within cones. If Bi ∩ Ki 6= ∅, we divide Ki into two cones along Bi . Similarly, if A−1 i Bi is one dimensional and A−1 B ∩ K = 6 ∅, we divide K i i i into two cones along i A−1 B . We retain the same notation for the CLS on the i i refinement of this partition. A. Reachability within Cones Controllability of Σ depends heavily on the reachable sets of Σi . Thus, in this section, the reachable sets of Σi are analyzed. Let ϕi (t, x0 ; u) denote the trajectory x(t), t ≥ 0, of x˙ = Ai x + bi u, satisfying x(0) = x0 . If bi 6= 0, let b∗i denote the
6270
46th IEEE CDC, New Orleans, USA, Dec. 12-14, 2007 unit vector which is orthogonal to bi and satisfies xT b∗i > 0 △ ¯ for all x ∈ Ki . If bi = 0, set b∗i = 0. Also, let Ki∗ = K i \{0}. For x ∈ Ki∗ define
FrC14.1 open line segment joining x ˜′′ and x′ + λ0 bi . Let ζ ∈ [0, ∞), ′′ ′ z˜ = x ˜ − x , and consider the feedback control u(t) =
△˘
ReachΣi (x) = ϕi (t, x; u) : t ≥ 0 , u ∈ U , ¯ ϕi (s, x; u) ∈ Ki∗ , ∀s ∈ [0, t] .
△
2
W (x, bi ) = {z ∈ R :
(b∗i )T z
>
(b∗i )T x}
(IV.12)
The closed-loop system resulting from (IV.12) is
To assist in the taxonomy of ReachΣi , define, for bi 6= 0, +
z˜T JAi x(t) − ζλ0 bT i Jx(t) . J z ˜ bT i
x(t) ˙ =
∪ {x}
bT i JAi x(t) z˜ − ζbT i Jx(t)λ0 bi . ˜ bT i Jz
(IV.13)
△
W − (x, bi ) = {z ∈ R2 : (b∗i )T z < (b∗i )T x} ∪ {x} . Lemma 4: Assume that if Ki∗ ∩ Bi 6= ∅ then Ai bi ∈ / Bi . For x ∈ Ki∗ the following hold: (A) If bi = 0, then ReachΣi (x) = eAi t x : t ≥ ′ 0 , and eAi t x ∈ Ki∗ , ∀t′ ∈ [0, t] . (B) If bi 6= 0 and range(Ai ) ⊂ Bi , then ReachΣi (x) = (x + Bi ) ∩ Ki∗ . (C) If bi 6= 0, and range(Ai ) 6⊂ Bi , then ReachΣi (x) = ( + T T ∗ W (x, bi ) ∩ Ki∗ ,
if (vi+1 + vi ) Ai bi > 0
W (x, bi ) ∩ Ki∗ , if (vi+1 + vi )T ATi b∗i < 0 . −
Proof: Cases (A) and (B) are obvious. For case (C) first note that since Ai x ∈ / Bi for all x ∈ Ki , we have ∗ xT AT i bi 6= 0. Suppose, without loss of generality, that ∗ xT AT i bi > 0, for all x ∈ Ki . It follows that if ϕi (s, x; u) ∈ ReachΣi (x), where x ∈ Ki∗ , s ∈ [0, t], and t > 0, then (b∗i )T ϕ˙ i (s, x; u) ≥ 0, for almost all s ∈ [0, t]. Suppose ϕi (t, x; u) 6= x. We claim that (b∗i )T ϕi (t, x; u) > (b∗i )T x. If not, then (b∗i )T ϕ˙ i (s, x; u) = 0 for almost all s ∈ (0, t), from which it follows that (b∗i )T Ai ϕi (s, x; u) = 0, for all s ∈ [0, t], or equivalently that Ai ϕi (s, x; u) ∈ Bi . This implies ϕi (s, x; u) 6∈ Ki , so either ϕi (s, x; u) ∈ Vi or ϕi (s, x; u) ∈ Vi+1 , for all s ∈ [0, t]. Suppose, without loss of generality, △ the latter is the case. Then, z = ϕi (t, x; u) − x ∈ Vi+1 is a nonzero vector in Ki∗ which satisfies z ∈ Bi (since by assumption (b∗i )T z = 0) and Ai z ∈ Bi . This contradicts the hypothesis of the lemma. Hence, ReachΣi (x) ⊂ W + (x, bi )∩ Ki∗ . To show the converse, let x′′ ∈ W + (x′ , bi )∩Ki∗ , x′′ 6= x′ , and set z = x′′ − x′ . Suppose, without loss of generality that b∗i = Jbi . If Ai x′ ∈ / Bi and Ai x′′ ∈ / Bi then if we let T −1 T u(t) = (bi Jz) z JAi x(t), we obtain x(t) ˙ = Ai x(t) + bi u(t) = bT JA x
bT i JAi x(t) z. bT i Jz
(IV.11)
Since ibT Jzi > 0 for all x = ξz + x′ , ξ ∈ [0, 1], it follows i by (IV.11) that the solution x(t) with x(0) = x′ , satisfies x(t′′ ) = x′′ for some finite t′′ > 0. Suppose that Ai x′′ ∈ Bi and Ai x′ ∈ / Bi . Since, by construction of the partition, A−1 B ∩ K = ∅, it must be the case that x′′ ∈ Vi ∪ Vi+1 . i i i Without loss of generality suppose x′′ ∈ Vi . If follows from the hypothesis that Vi 6⊂ Bi and thus the line x′ + λbi , λ ∈ R intersects Vi , i.e., x′ + λ0 bi ∈ Vi , for some λ0 ∈ R. Since Ai x′ ∈ / Bi , implying x′ ∈ / Vi , it follows that λ0 6= 0. We know that x′ + λ0 bi 6= x′′ , since x′′ ∈ W + (x′ , bi ). Let x ˜′′ ∈ Vi ∩ W + (x′ , bi ) be any point such that x′′ lies in the
It follows from the foregoing that if ζ = 0 then the trajectory x(t) of (IV.13) starting at x(0) = x′ converges asymptotically to x ˜′′ , along the straight line joining these two ∗ T points. Also, since bT i Jx = −(bi ) x < 0, for all x ∈ Ki , T ′ and bi Jx 6= 0, the vector field bT i Jx(t)λ0 bi results in a trajectory that joins x′ and x′ + λ0 bi along a straight line in finite time. For y ∈ Ki let η1 (y) = Vi ∩{y+̺˜ z | ̺ ∈ R} , η2 (y) = Vi ∩{y+̺bi | ̺ ∈ R} , and define Γy = conv{y, η1 (y), η2 (y)}, where ‘conv’ denotes the convex hull. Let γy (t, ζ), with t ≥ 0, denote the trajectory of (IV.13), starting from y, i.e., γy (0, ζ) = y, and set △ τ (y, ζ) = inf {t ≥ 0 : γy (t, ζ) ∈ V1 } . It is evident from the direction of the vector field of (IV.13) that provided ζ > 0, then τ (y, ζ) < ∞ and (IV.14) γy (t, ζ) : t ∈ 0, τ (y, ζ) ⊂ Γyo ,
with Γyo denoting the interior of Γy . In particular, for ζ > 0, x′′ , x′ + γx′ (τ (x′ , ζ), ζ) lies in the relative interior of conv{˜ λ0 bi }. Since the vector field of (IV.13) is transversal to Vi , τ (x′ , ζ) is continuous in ζ ∈ (0, ∞), and in turn, the same holds for γx′ (τ (x′ , ζ), ζ). Continuity of the solution of (IV.13) with respect to ζ, combined with (IV.14), shows that ( x ˜′′ as ζ → 0 γx′ τ (x′ , ζ), ζ → x′ + λ0 bi as ζ → ∞ . Therefore, γx′ τ (x′ , ζ ′′ ), ζ ′′ = x′′ , for some ζ ′′ ∈ (0, ∞). If Ai x′′ ∈ / Bi and Ai x′ ∈ Bi , the conclusion follows along the same lines, by using time reversal. If Ai x′′ ∈ Bi and Ai x′ ∈ Bi , using an intermediate point x ˆ ∈ Ki ′ T T ′′ satisfying Ai x ˆ ∈ Bi and bT Jx < b J x ˆ < b Jx , the i i i ′ previous arguments show that x ˆ ∈ ReachΣi (x ) and x′′ ∈ x). ReachΣi (ˆ The proof of Lemma 4 shows that linear feedback control can be used to steer in ReachΣi as stated in the following corollary. Corollary 3: Assume that if Ki∗ ∩ Bi 6= ∅ then Ai bi ∈ / Bi . Also, suppose bi 6= 0 and range(Ai ) 6⊂ Bi . Let x′ ∈ Ki∗ and x′′ ∈ ReachΣi (x′ ) such that span{Ai x′ , Ai x′′ } 6⊂ Bi . Then there is a feedback control u = kiT x, for some ki ∈ R2 , such that the trajectory x(t), with x(0) = x′ , satisfies x(t′′ ) = x′′ , for some t′′ > 0 and x(t) ∈ Ki for all t ∈ (0, t′′ ).
6271
46th IEEE CDC, New Orleans, USA, Dec. 12-14, 2007 B. Reachability between Cones In this section we analyze the existence of controlled trajectories (over Um ) starting in Ki and reaching Ki+1 , and vice versa. The main idea is to analyze the possible directions of flow of Σi and Σi+1 along Vi+1 . We use the notation Ki Ki+1 to indicate that there exists a controlled trajectory x(·) in Ki ∪ Ki+1 , defined for t ∈ [−ε, ε], with ε > 0 and satisfying x(−ε) ∈ Ki , x(ε) ∈ Ki+1 . Analogously for Ki+1 Ki . In order to indicate the direction (counterclockwise, or clockwise) that the boundary Vi can be crossed by controlled trajectories, we define the set Gi ⊆ {1, −1} with the property that 1 ∈ Gi if Ki Ki+1 , and −1 ∈ Gi if Ki+1 Ki . Let △ βi+ =
nT i bi
,
△ βi− =
nT i+1 bi
.
Then using (III.1) and the signum function, and allowing for discontinuous controls, we have ˘ − − − + ′ + Gi = sgn(α− i + uβi ) : (αi + uβi )(αi+1 + u βi+1 ) > 0 , ¯ ∃u, u′ ∈ R . (IV.15)
A more explicit characterization of Gi is provided by the following lemma. Lemma 5: For each i ∈ I, + (i) If βi+1 βi− 6= 0, then Gi = {1, −1}. + (ii) If βi+1 βi− = 0, then + − {sgn(α− if βi+1 = βi− = 0 , α+ i )} i+1 αi > 0 + {sgn(α− if βi+1 6= 0 i )} Gi = − {sgn(α+ i+1 )} if βi 6= 0 {∅} otherwise. C. Main Result
In this section we gather the previous results on reachability within and between cones to obtain our main result on controllability. The essential idea is to analyze trajectories which encircle the origin either in a counterclockwise or clockwise sense. We compute the maximum and minimum growth around such a cycle. Necessary and sufficient conditions for controllability are obtained in terms of these growth factors—both shrinkage and expansion must be possible. The existence of trajectories that encircle the origin is a necessary condition for controllability of Σ; for if not, either some Vi is invariant under any controlled trajectory or there is a subcollection of cones whose T union is invariant under △ any controlled trajectory. Let G = i∈I Gi . We require the following. Condition 1: G = 6 ∅. Note that under Condition 1 the hypothesis of Lemma 4 is + satisfied for all i ∈ I. For if not, then either α+ i = βi = 0, − − or αi = βi = 0, resulting in Gi = ∅. It is necessary to determine the growth around a cycle, as in Theorem 2. We define the inverse of the maximum possible growth in Kj as ξ j (κ) and the minimum possible
FrC14.1 growth in Kj as ξ j (κ). These growth factors can be computed explicitly using Lemma 4. Definition 2: Assume Condition 1. Define for j ∈ I and κ∈G ∗ 0 if (vj+1 + vj )T AT j bj > 0 −κ vjT b∗ j ∗ ξ j (κ) = if (vj+1 + vj )T AT T j bj ≤ 0 , bj 6= 0 vj+1 b∗ j e−κµj τj if bj = 0 , T ∗ κ vj bj T b∗ vj+1 j ξ j (κ) = 0 κµj τj e
∗ if (vj+1 + vj )T AT j bj ≥ 0 , bj 6= 0 ∗ if (vj+1 + vj )T AT j bj < 0
if bj = 0 .
Here µj and τj are the trajectory growth rate and time to transverse Kj computed in Section III. Theorem 4: For Σ to be completely controllable on R2∗ , over Um , it is necessary and sufficient that (a) Condition 1 holds. (b) For some κ ∈ G the following inequalities hold △
ξ(κ) =
ℓ Y
△
ξ j (κ) < 1 ,
ξ(κ) =
j=1
ℓ Y
ξ j (κ) < 1 .
j=1
(IV.16) Proof: Necessity of (a) has been discussed earlier. Note that if ∗ (vj+1 + vj )T AT (IV.17) j bj = 0 , and bj 6= 0, then necessarily range(Aj ) ⊂ Bj . Thus if (IV.17) holds for all j ∈ I, the reachable set from every point x is one-dimensional. It follows that if Σ is completely controllable, then ξ(κ)ξ(κ) = 0. To show that (b) is necessary, first observe that if G = {1, −1}, then ξ(κ) = ξ −1 (−κ), provided ξ(κ) 6= 0, otherwise ξ(κ) = ξ(−κ) = 0. Similarly for ξ(κ). It follows from these arguments that if (b) does not hold, then we may Q suppose without loss of generality that G = {1}, and ℓj=1 ξ j (κ) ≥ 1. Consider the collection of points zi ∈ Vi defined by z1 = v1 and µτ if bj = 0 e j j z j zj+1 = vT b∗ j j+1 z otherwise. v T b∗ j j
j
2∗
Let γ : [0, ℓ] → R , be the curve defined by γ(s) = µ (s−j+1)τ j zj if bj = 0 , s ∈ [j − 1, j] e j `
zj+1 + (s − j) zj+1 −
T vj+1 b∗ j
vjT b∗ j
zj
´
if bj 6= 0 , s ∈ [j − 1, j].
According to the hypothesis kzℓ+1 k ≤ kv1 k. Consider the Jordan curve consisting of {γ(s) , s ∈ [0, ℓ]} and the straight segment [zℓ+1 , z1 ] ⊂ V1 and let D denote its interior. It ¯ thus arriving follows by Lemma 4, that ReachΣ (v1 ) ⊂ D, at a contradiction. Sufficiency: Assume (a)–(b). Without loss of generality suppose 1 ∈ G, and ξ(1) < 1, ξ(1) < 1. By Lemma 4,
6272
46th IEEE CDC, New Orleans, USA, Dec. 12-14, 2007 if bj 6= 0 and range(Aj ) 6⊂ Bj , then ReachΣj (vj ) ∩ Vj+1 contains all points of the form ̺j vj+1 , where T ∗ vj bj ∗ , ∞ , if (vj+1 + vj )T AT T j bj > 0 vj+1 b∗ j ̺j ∈ T ∗ ∗ 0, Tvj bj ∗ , if (vj+1 + vj )T AT j bj < 0 . v b j+1 j
Otherwise, ReachΣj (vj ) ∩ Vj+1 = {̺j vj+1 }, where T ∗ v b vTj jb∗ , if bj 6= 0 , and range(Aj ) ⊂ Bj j+1 j ̺j = eµj τj , if bj = 0 .
Then, by considering the trajectories that follow a complete cycle, we have if ξ(1) = 0 ̺v1 : ̺ ∈ (ξ(1), ∞) ReachΣ (v1 )∩V1 ⊃ ̺v1 : ̺ ∈ (0, ξ −1 (1)) otherwise. (IV.18) Iterating (IV.18) we obtain ReachΣ (v1 ) ∩ V1 ⊃ V1 , and the result now easily follows. Remark 2: If ξ(κ)ξ(κ) = 0 then (IV.16) implies ξ(κ) + ξ(κ) < 1. On the other hand, if ξ(κ)ξ(κ) 6= 0 then ξ(κ) = −1 ξ (κ) and (IV.16) does not hold. It follows that (IV.16) in Theorem 4 may be replaced by ξ(κ) + ξ(κ) < 1. Example 1: In this example none of the individual pairs (Ai , bi ) are controllable, yet the CLS is completely controllable. Let Ki , i = 1, 2, 3, 4, correspond to the four quadrants of the plane in counterclockwise order. We define 1 3 0 A1 = 0 3 , b1 = −1 2 A2 = 0 , b2 = 1 0 1 b3 = b4 = 0 . A3 = A4 = −1 0 ,
An easy calculation yields G = {−1}, ξ(−1) = 0, and ξ(−1) = 0.5. Example 2: In this example all of the individual pairs (Ai , bi ) are controllable and conditions (a)–(b) of Theorem 4 are satisfied, yet the CLS is not completely controllable. As in Example 1, let Ki , i = 1, 2, 3, 4, correspond to the four quadrants of the plane in counterclockwise order. We define −1 1 2 3 A1 = A3 = −1 −1 , A2 = A4 = −3 2 , −1 1 −1 1 b1 = −1 , b2 = 1 , b3 = , b4 = −1 . 1
Here, G = {1, −1}, ξ(1) = ξ(−1) = 0, and ξ(1) = ξ(−1) = 1. V. S TABILIZATION
Theorem 5: Suppose Σ is completely controllable over Um . Then it is stabilizable by piecewise-linear feedback of the form u = kiT x, for x ∈ Ki , where ki ∈ R2 , i ∈ I. Proof: Without loss of generality suppose 1 ∈ G, and ξ(1) < 1. Let i ∈ I be arbitrary. By Lemma 4, if bi 6= 0
FrC14.1 and range(Ai ) 6⊂ Bi , then points of the form ̺i vi+1 , h T ∗ vi bi , ∞ , vi+1 T b∗ i ̺i ∈ i 0, viT b∗i , T ∗ v b i+1 i
ReachΣi (vi ) ∩ Vi+1 contains all where ∗ if (vi+1 + vi )T AT i bi > 0 ∗ if (vi+1 + vi )T AT i bi < 0 .
Moreover, by Corollary 3, for any such ̺i , there exists a constant gain ki = ki (̺i ), such that under the control u = kiT x, the closed-loop trajectory in Ki∗ steers vi to ̺i vi+1 . On the other hand, if bi 6= 0 and range(Ai ) ⊂ Bi , then ReachΣi (vi ) = (vi +Bi )∩Ki∗ . In this case, it easily follows that for some ζi ∈ R, the closed-loop trajectory starting at vi and under the feedback control u = ζi b∗i , is a straight v T b∗ line segment in Ki∗ that joins vi to vTi ib∗ vi+1 . Hence we set ̺i =
i+1 i
viT b∗ i . T vi+1 b∗ i
Lastly, if bi = 0, in view of Lemma 4, set Q µi τi that the collection . Since i∈I ξ i < 1, it follows Q ̺i = e {̺i , i ∈ I} may be selected such that i∈I ̺i < 1. Let γ˜ denote the segment of the closed-loop trajectory under Q ̺ a complete cycle. Clearly γ˜ steers v1 to to i∈I i v1 , and it easily follows that the closed-loop trajectory converges asymptotically to the origin. Since, by linear scaling every x ∈ R2∗ satisfies λx ∈ γ˜ for some λ > 0, it follows that the closed-loop system is asymptotically stable. Remark 3: As seen in Example 2, even if every pair (Ai , bi ) is controllable, the system might not be stabilizable by state feedback. This connects directly to the stability analysis. Despite the fact that the eigenvalues of the closed loop system Ai + bi kiT can be selected to have any negative values desired, thus making the coefficients αi as negative as desired, this process also affects the gains βi in a manner that might always result in an unstable system. R EFERENCES [1] M. Pachter, D. H. Jacobson, The stability of planar dynamical systems linear-in-cones, IEEE Trans. Automat. Control 26 (2) (1981) 587–590. [2] M. Pachter, D. H. Jacobson, Observability with a conic observation set, IEEE Trans. Automat. Control 24 (4) (1979) 632–633. [3] D. Liberzon, J. P. Hespanha, A. S. Morse, Stability of switched systems: a Lie-algebraic condition, Systems Control Lett. 37 (3) (1999) 117–122. [4] A. A. Agrachev, D. Liberzon, Lie-algebraic stability criteria for switched systems, SIAM J. Control Optim. 40 (1) (2001) 253–269. [5] A. Arapostathis, M. Broucke, Stability and controllability of planar, conewise linear systems, Systems Control Lett. 56 (12) (2007) 150– 158, hybrid control systems. [6] W. Boothby, A transitivity problem from control theory, J. of Differential Equations 17 (2) (1975) 296–307. [7] J.-I. Imura, A. van der Schaft, Characterization of well-posedness of piecewise-linear systems, IEEE Trans. Automat. Control 45 (9) (2000) 1600–1619. [8] A. Bacciotti, Some remarks on generalized solutions of discontinuous differential equations, Int. J. Pure Appl. Math. 10 (3) (2004) 257–266. [9] F. H. Clarke, Y. S. Ledyaev, E. D. Sontag, A. I. Subbotin, Asymptotic controllability implies feedback stabilization, IEEE Trans. Automat. Control 42 (10) (1997) 1394–1407. [10] A. F. Filippov, Differential equations whose right-hand side is discontinuous on intersecting surfaces, Differentsial′ nye Uravneniya 15 (10) (1979) 1814–1823. [11] A. F. Filippov, Differential equations with discontinuous righthand sides, Vol. 18 of Mathematics and its Applications (Soviet Series), Kluwer Academic Publishers Group, Dordrecht, 1988, translated from the Russian.
6273