Unified Frequency Domain Inequalities With Design Applications

Report 9 Downloads 21 Views
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 1, JANUARY 2005

41

Generalized KYP Lemma: Unified Frequency Domain Inequalities With Design Applications Tetsuya Iwasaki, Senior Member, IEEE, and Shinji Hara, Senior Member, IEEE

Abstract—The celebrated Kalman–Yakuboviˇc–Popov (KYP) lemma establishes the equivalence between a frequency domain inequality (FDI) and a linear matrix inequality, and has played one of the most fundamental roles in systems and control theory. This paper first develops a necessary and sufficient condition for an -procedure to be lossless, and uses the result to generalize the KYP lemma in two aspects—the frequency range and the class of systems—and to unify various existing versions by a single theorem. In particular, our result covers FDIs in finite frequency intervals for both continuous/discrete-time settings as opposed to the standard infinite frequency range. The class of systems for which FDIs are considered is no longer constrained to be proper, and nonproper transfer functions including polynomials can also be treated. We study implications of this generalization, and develop a proper interface between the basic result and various engineering applications. Specifically, it is shown that our result allows us to solve a certain class of system design problems with multiple specifications on the gain/phase properties in several frequency ranges. The method is illustrated by numerical design examples of digital filters and proportional-integral-derivative controllers. Index Terms—Control design, digital filter, frequency domain inequality, Kalman–Yakuboviˇc–Popov (KYP) lemma, linear matrix inequality (LMI).

I. INTRODUCTION

O

NE OF THE most fundamental results in the field of dynamical systems analysis, feedback control, and signal processing, is the Kalman–Yakuboviˇc–Popov (KYP) lemma [1]–[3]. Various properties of dynamical systems can be characterized by a set of inequality constraints in the frequency domain. The KYP lemma establishes equivalence between such frequency domain inequality (FDI) for a transfer function and a linear matrix inequality (LMI) for its state space realization. The basic roles of the KYP lemma are two fold: it provides 1) insights into analytical approaches to systems theory, and 2) a framework for numerical approaches to systems analysis and synthesis.

Manuscript received September 4, 2003; revised May 25, 2004 and September 11, 2004. Recommended by Associate Editor Y. Ohta. This work was supported in part by the National Science Foundation under Grant 0237708, by The Ministry of Education, Science, Sport, and Culture, Japan, under Grant 14550439, by CREST of the Japan Science and Technology Agency (JST), and by the 21st Century COE Program on Information Science and Technology Strategic Core. T. Iwasaki is with the Department of Mechanical and Aerospace Engineering, University of Virginia, Charlottesville, VA 22904-4746 USA (e-mail: [email protected]). S. Hara is with the Department of Information Physics and Computing, Graduate School of Information Science and Engineering, The University of Tokyo, Tokyo 113-8656, Japan (e-mail: [email protected]). Digital Object Identifier 10.1109/TAC.2004.840475

The KYP lemma [3] states that, given matrices Hermitian matrix , the FDI

,

, and a

(1) holds for all

if and only if the LMI (2)

admits a Hermitian solution . Thus, the infinitely many inequalities (1) parametrized by can be checked by solving the finite-dimensional convex feasibility problem (2). Appropriate choices of in (1) allows us to represent various system properties including positive-realness and bounded-realness. While the KYP lemma has been a major machinery for developing systems theory, it is not completely compatible with practical requirements. In particular, each design specification is often given not for the entire frequency range but rather for a certain frequency range of relevance. For instance, a closed-loop shaping control design typically requires small sensitivity in a low frequency range and small complementary sensitivity in a high frequency range. Thus a set of specifications would generally consists of different requirements in various frequency ranges. On the other hand, the standard KYP lemma treats FDIs for the entire frequency range only. The current state of the art for fixing the incompatibility is to introduce the so-called weighting functions. A low/band/highpass filter would be added to the system in series as a weight that emphasizes a particular frequency range and then the design parameters are chosen such that the weighted system norm is small. The weighting method has proven useful in practice, but there remains some room for improvement. First, the additional weights tend to increase the system complexity (e.g., controller order), and the amount of increased complexity is positively correlated with the complexity of the weights. Second, the process of selecting appropriate weights can be time-consuming, especially when the designer has to shoot for a good tradeoff between the complexity of the weights and the accuracy in capturing desired specifications. We remark that some of these deficiencies may be addressed by recent developments for new characterizations of disturbance signals [4], [5]. An alternative approach is to grid the frequency axis. In this case, infinitely many FDIs are approximated by a finite number of FDIs at selected frequency points. This approach has a practical significance especially when the system is well damped and the frequency response (after the design) is expected to be “smooth” (i.e., no sharp peaks). The resulting computational

0018-9286/$20.00 © 2005 IEEE

42

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 1, JANUARY 2005

burden often seems moderate. However, it is difficult to determine a priori how fine the grid should be to achieve a certain performance. The method also suffers from the lack of a rigorous performance guarantee in the design process as the violation of the specifications may occur between grid points. Another approach that avoids both weighting functions and frequency gridding is to generalize the fundamental machinery, the KYP lemma, in such a way that FDIs in finite frequency ranges can be treated directly. Although the time-consuming design iterations may not be completely eliminated with this approach, it is expected that the ability to directly deal with finite frequency ranges without weighting functions would provide a more user-friendly platform for systems design. There are several results in the literature along this line. The finite frequency , KYP lemma obtained in [6] states that, given a scalar matrices , , and a Hermitian matrix , the FDI (1) holds in the low frequency range if and only if the LMI

(3) admits Hermitian solutions and . The nonstrict inequality version of this result has been obtained in [7] in the context of integrated design of structure/control systems. Other generalizations of the KYP lemma include those in [8]–[10] where FDIs for polynomials are considered in the discrete-time setting. Specifically, algebraic conditions are given to characterize polynomials that are positive on an arc of the unit circle in the complex plane. Some of these results are developed for the purpose of digital filter design. The main objective of this paper is to further generalize the KYP lemma and to unify all the previous extensions previously mentioned by a single theorem. There are two basic aspects in our generalization—the frequency range and the system class. In particular, we consider the frequency intervals characterized by two quadratic forms of the frequency variable . This characterization captures curve(s) on the complex plane and encompasses low/middle/high frequency conditions for both continuous-time and discrete-time systems. Thus, the result includes the previous results on the low/middle frequency conditions for continuous-time systems [7] as special cases, and adds new contributions to the cases involving high frequency conditions and discrete-time systems. The system class is generalized by, roughly speaking, replacing the term in (1) with a new term of the form . It turns out that this allows us to capture FDIs for descriptor systems as well as polynomials. The generalized KYP lemma will be obtained within the framework of the -procedure [11], [12] that has been extensively used in the systems and control literature (e.g. [13]–[16]). It converts an inequality condition with multiple constraints to an unconstrained inequality condition with multipliers. The -procedure is conservative in general, i.e., the latter condition implies the former, but the converse is not always true. Fradkov showed in 1973 that the -procedure with scalar multipliers is lossless (i.e., nonconservative) if and only if a certain rank-one property holds for an associated separating hyperplane [17].

This type of rank-one property has been used as a sufficient condition for the -procedure to be exact in a more general setting of matrix-valued multipliers [3], [6], [18]–[21]. In this paper, we will prove necessity of the rank-one property, providing a necessary and sufficient condition for the -procedure to be lossless in the general setting. Our result on the -procedure will be used to generalize the KYP lemma, but it will also be of independent interest as a basis for systems analysis in general. Another contribution of the paper is to develop a useful interface between the generalized KYP lemma and various engineering applications. We will show that the design specifications encompassed by our result can be summarized, in the case of single-input–single-output (SISO) transfer function designs, as follows. The Nyquist plot of the transfer function within a prescribed frequency interval must lie in the intersection of prescribed conic sections on the complex plane. It will also be shown that several classes of important engineering problems can be formulated in terms of such specifications, and can be solved exactly through LMI optimizations. Finally, design examples of digital filters and proportional-integral-derivative (PID) controllers will illustrate the procedures and advantages of our approach in comparison with existing ones. This paper is organized as follows. We will first discuss in Section II system design problems that are naturally described by FDI specifications in various frequency ranges. These problems motivate our generalization of the KYP lemma to treat finite frequency ranges. Section III develops an extension of the -procedure and obtains a necessary and sufficient condition for losslessness. Section IV gives a characterization of general frequency ranges (curves on the complex plane) in terms of quadratic forms of the frequency variable. Section V presents our main results that generalize the standard KYP lemma. We will then show in Section VI how various FDI specifications for system design can be described within our framework. Finally, Section VII shows what classes of design problems can be solved via convex optimizations, where analytical discussions will be followed by design examples. We use the following notation. For a matrix , its transpose, complex conjugate transpose, and the Moore–Penrose inverse , , and , respectively. The real and are denoted by are denoted by and . The imaginary parts of stands for the set of Hermitian matrices. For symbol , inequalities and a matrix denote positive (semi)definiteness and negative (semi)definiteis deness, respectively. The set of matrices means the Kronoted by . For matrices and , and , a function necker product. For is defined by

(4) The convex hull and the interior of a set are denoted by and , respectively. Given a positive integer , let be the . set of nonnegative integers up to , i.e., Finally, denotes the set of positive integers.

IWASAKI AND HARA: GENERALIZED KYP LEMMA

43

II. MOTIVATION: SYSTEM DESIGN We will first motivate our research through several examples of design problems for which our generalized KYP lemma is naturally suited and can be useful. Later in the paper, some of these problems will be discussed in more detail with solution procedures and numerical design examples. Digital filter design: The problem is to find a stable SISO transfer function satisfying a set of frequency domain specifications. Typical design requirements for band-pass filters are given by the following Chebyshev approximation setting (see, e.g., [22]–[25]): : : : : all

(5)

where and is a given function that . An has desired gain/phase properties in the pass-band would have the unity magnitude and the ideal response linear phase in the pass-band, i.e., . The first three conditions specify the stop-bands and the pass-band, while the last condition is added to suppress overshoot in the transition and . bands Sensitivity-shaping: This is a typical control design with specifications on the closed-loop transfer functions [26], [27]. and a controller , the sensitivity and the For a plant and are defined complementary sensitivity functions by

Fig. 1.

Open-loop shaping specifications.

the low frequency range for the sensitivity reduction. The servo bandwidth requirement in the middle frequency range might be represented by an elliptic constraint as shown in the figure, where we may maximize the bandwidth. The two straight lines provide gain/phase lying to the right of the point stability margins, and the small circle centered at the origin relates to the roll-off requirement in the high frequency range for robust stability. Structure/control design integration: Control performance of mechanical structures can be significantly enhanced if the designs of the structure and the controller are integrated [7], [28]–[30]. It has been shown that the finite frequency positivereal (FFPR) property in the low frequency range (6)

The sensitivity-shaping problem typically consists of the following type of requirements: : : : all : all where . The first constraint may represent good tracking of a reference signal with spectral contents in the low frequency range, while the second may be interpreted as a robustness requirement against unmodeled high frequency dynamics of the plant. The last two constraints tend to avoid oscillatory time responses. Open-loop shaping: This is a classical control design problem of determining the controller parameters to meet specifications on the open-loop transfer function. Given a , a set of specifications on marginally stable SISO plant is given in terms of the Nyquist plot of the controller the open-loop transfer function . Fig. 1 shows typical design requirements, where the shaded regions indicate where the Nyquist plot should lie in various frequency at the lower right ranges. The half plane constraint on part of the figure corresponds to the high-gain requirement in

is an important requirement for mechanical structure design to guarantee the existence of controllers that achieve high servo-bandwidth [7], [31], where is the transfer function of the mechanical system to be designed. These references demonstrated that structures of practical significance can be designed to achieve the FFPR property by solving LMI feasibility problem with the aid of an existing version of the finite frequency KYP lemma [6], [7]. In particular, the method has been applied to a shape design of a swing arm for hard disk drives [7] and a smart arm design using piezo-electric film [31]. III.

-PROCEDURE

We will first develop a generalized version of the -procedure that converts a constrained inequality to an unconstrained inequality with multiplier(s) [12], [17], [32], [33]. The result is instrumental to the proof of the generalized KYP lemma to be stated later, and will be of independent interest as a tool for developing systems theory. A. Classical Form and Its Generalization The classic version of the -procedure [12], [33] is the following. Given , , we have the equivalence such that such that

44

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 1, JANUARY 2005

where the regularity, version

, is assumed. The strict inequality such that

such that also holds, and for this equivalence the regularity assumption is is negative semidefinite in particular, the not required. When statement reduces to the Finsler’s theorem [34]. The -procedure is to replace the condition on the left of “ ” by the one on the right. It may be difficult to check the former condition directly because the inequalities impose nonconvex constraints on in general. On the other hand, the latter condition is easier to verify as it is a search for the scalar multiplier satisfying convex constraints. To generalize the classical -procedures, let us rewrite them with different notation. First note that the set of matrix-valued multipliers associated with the classical -procedures can be defined as

a) admissible if it is a nonempty closed convex cone2 and ; ; b) regular if c) rank-one separable if . It can readily be verified by a separation theorem (Lemma is 11 in Appendix II) that a nonempty closed convex cone admissible if and only if the set is nonempty. Note that is the set of the hyperplanes that separate from , motivating the term “rank-one separable” in c). The definition of regularity is consistent with the standard notion. For example, the set in . It will turn out that regularity (7) is regular if and only if corresponds to controllability when we consider a special class that is useful for systems analysis. of Some examples of admissible, regular, rank-one separable in (7) with and sets include

(11)

(7) Also note that a general Hermitian form can be expressed as where satisfies and . Then alternative expressions for the aforementioned -procedures are given by1 (8) (9) where

is a set specified by

as follows:

rank

(10)

on in For brevity, we will omit the dependence of and the development below. Clearly, the -procedures are completely specified by the set through (8)–(10). We consider a generalization of the -procedure to the case where is an arbitrary subset of Hermitian matrices, rather than the specific set in (7). In this case, the equivalence in (8) or (9) no longer holds in general. In fact, it is easy to verify that the implication “ ” always holds, but the converse “ ” may or may not, depending upon and . The -procedure is said to be lossless if not only “ ” but also “ ” hold. B. Characterization of Lossless -Procedure The objective of this section is to show under what condithe associated -procedure is lossless regardless of tion on . The previous result in [6] provided a sufficient condition for the strict version of the -procedure (9) to be lossless. Our result below shows that the previous condition turns out to be also necessary, and is valid for the nonstrict case (8) as well under a regularity assumption. To this end, let us introduce the following. is said to be Definition 1: A set

tr(2S )  0

tr(2 )  0

1Notation means that S holds for all S type of notation will be used throughout this section.

2S

It is straightforward to verify that these sets are admissible and regular. The rank-one separability has been proven for (7) in in [3], [35], and for in [6]. The set re[12], for lates to the entire frequency range in the continuous-time setassociated with via (10) is ting. Specifically, the set with , given by matrices of the form for some whenever [3]. On the other hand, the set relates to a finite frequency range by and [6], [19]. The multiplier associated with the ( , ) scaling that gives a real upper bound [36] has been shown to be rank-one . Other results in separable [19], and is closely related to the literature (e.g. [18] and [37]) may also be interpreted within the framework of rank-one separability. The following result provides a necessary and sufficient condition for the S-procedure to be lossless. Theorem 1 (S-procedure): Let an admissible set be given and define by (10). Then, the strict S-procedure is , if and only if lossless, i.e., (9) holds, for an arbitrary is rank-one separable. Moreover, assuming that is regular, the nonstrict -procedure is lossless, i.e., (8) holds, for any , if and only if is rank-one separable. Proof: We shall prove the result for the nonstrict inequality case. The strict inequality case can be proven similarly. is rank-one separable. Let us first prove sufficiency. Suppose Note that the implication “ ” in (8) is obvious. To show the . Then, from Lemma converse, suppose that guarantees that there exists 10 in Appendix II, regularity of such that has no solution . From an a separating hyperplane argument (Lemma 11 in Appendix II), such that there exists a nonzero

Since

. This 2A

set

is a convex cone containing the origin, we have

M is a cone if M 2 M implies  M 2 M for all   0.

IWASAKI AND HARA: GENERALIZED KYP LEMMA

45

Thus, trices

. Since is rank-one separable, there exist masuch that . Note that implies that must be true for some index . This means that the condition on the left in (8) does not hold, and we conclude that “ ” holds in (8). We now prove necessity. The main idea follows a result by Fradkov [17, Th. 19] that proved necessity for a different case is of the type given by (7). Suppose is not rank-one where such that and separable. Then there exists because . Note that the latter condition is disjoint from . Since means that the point is a closed convex cone and , the infimum of the disand is strictly positive. Consequently, tance between a separation theorem [38, Th. 11.4] infers that the two sets are strongly separable by a hyperplane, i.e., there exists a Hermitian matrix such that and

Clearly, the second condition implies . On the other hand, the first condition implies that because otherwise we have the following contradiction: For such that

where we noted that , and due to and . Thus, we have shown that if is not rank-one separable, then there exists for which the left-hand side (LHS) of (8) holds but the right-hand side (RHS) does not. Hence, for such , (8) does not hold, proving necessity. Theorem 1 shows that the rank-one separability is necessary and sufficient for the -procedure to be lossless regardless of the choice of . It should be noted that the rank-one separability may not be necessary for a particular choice of . Indeed, a recent result by Ebihara et al. [39] provides a sufficient condition for losslessness when is negative semidefinite, and their condition is weaker than the notion of rank-one separability. A class of rank-one separable sets can be generated by a transformation of a particular rank-one separable set as follows. The result will be used to derive a generalized KYP lemma in a later section. be a rank-one separable set. Then Lemma 1: Let the set is rank-one separable for any matrix and subset of positive–semidefinite matrices containing the origin. be a nonzero positive–semidefinite Proof: Let matrix such that . Since , we have . Since is rank-one separable, there exist such that an integer and vectors

and . By [34, Th. 2.3.1], the former implies existence of such that

Then, noting that

hold for all

, it can be verified that

and

IV. FREQUENCY RANGE CHARACTERIZATION One of the key developments in this paper is a unified characterization of finite frequency ranges for both continuous-time and discrete-time systems. In general, a frequency range is visualized as a curve (or curves) on the complex plane. For instance, a low frequency range in the continuous-time setting is a line segment on the imaginary axis containing the origin. A frequency range in the discrete-time setting would be an arc of the unit circle. The objective of this section is to propose a general framework for representing various curves (frequency ranges), which will later be used for characterizing FDIs. Let us first clarify what we mean by curves on the complex plane. Definition 2: A curve on the complex plane is a collection of continuously parametrized by infinitely many points for where , and .A is said to represent a curve (or set of complex numbers curves) if it is a union of a finite number of curve(s). We consider the following set of complex numbers that represents a certain class of curves: (12) are given matrices and the function is where , defined in (4). The dependence of on and may be made when no confusion implicit and is used in place of should arise. We will explain what type of curves can be repthrough what choices of and . The resented by has been extensively studied in the literaspecial case ture (e.g. [40]) with the notion of “ -stability”, and the previous work provides a basis for the analysis in the next paragraph. is the intersection of and Note that the set . It can readily be verified that the set represents . If , then it is a curve if and only if an empty set, the entire complex plane, or a set with a single , then it is either a circle or a straight line. element. If Conversely, every circle or a straight line can be represented by with such that . In particular

defines the circle of radius Without loss of generality, we may assume that to be zero. Define lowing for some be a full rank factorization. Then . By [34, Th. 2.3.8], there exists such that

. This completes the proof.

with center at

, and

by aland defines a straight line with the normal vector . The set is a region on the complex plane with the boundary

46

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 1, JANUARY 2005

whenever ; it is the inside or outside of a , then it is the entire complex circle, or a half plane. If and , it is plane. The remaining case is trivial; if either an empty set or a set with a single element. represents curve(s) only if We now see that the set and either or , in which case, it can be partial or whole segment(s) of a circle or a straight line. When the necessary condition is satisfied, it is possible that the and is empty or intersection of the two sets a single point, rendering the condition not sufficient. To state to represent a necessary and sufficient condition for curve(s), let us introduce a simultaneous matrix factorization result. A proof is given in Appendix I. Lemma 2: Let , be given. Suppose . Then there exists a common congruence transformation such that (13) (14) where , , and . In particular, and can . be ordered to satisfy With the factorizations in (13), the essential part of the set may be captured by the canonical set . Specifically, the former can be generated by the bilinear transformation of the latter. The following lemma formally states and proves this fact in the general setting where , are arbitrary. and nonsingular be Lemma 3: Let , by given. Define scalars , , , and and function (15) Then the following holds true:

Conversely, let that

be an element of the set on the LHS. Note

Hence, is well defined and it can be verified that . Since then follows from the previous identity, we see that belongs to the set on the RHS of (16). with and deLet us now examine the set fined in (14). Note that holds if and only if for some . Moreover, for such , we have . Hence, the set is the entire imaginary if , and is a partial segment of if axis ; otherwise, it is either empty or a single point. Conis conceived as the image of (a segsequently, the set ment of) the imaginary axis mapped through the bilinear transis an invertible, continuous mapping alformation. Since most everywhere on , represents curve(s) if and only does so. if The following result summarizes the aforementioned development. Its formal proof is omitted for brevity. be given and define the set Proposition 1: Let , by (12). Then the set represents curve(s) on the complex plane if and only if the following two conditions hold: ; • or ; • either where and are defined by the factorization in (13). In this case, there are two possibilities. I) is a circle or a line specified by since . is a partial segment (or II) segments) of a circle or a line specified by . By an appropriate choice of and in (12), the set can be specialized to define a certain range of the frequency variable . For the continuous-time setting, we have

(16) Proof: Let be an element of the set on the RHS of (16). such that and Then there exists . Note that the following identity holds for any and if :

From this identity and and have

, it readily follows that holds and, hence, we . Moreover

due to nonsingularity of . Hence, we see that seton the LHS of (16).

belongs to the

where is a subset of real numbers specified by an additional choice of , for instance, as follows:

where , and LF, HF, and MF stand for low, high, and middle frequency ranges, respectively. Similarly, for the discrete-time setting, we have

IWASAKI AND HARA: GENERALIZED KYP LEMMA

where

47

is a subset of real numbers specified by

where

, , and . Detailed derivations of these tables are omitted here due to space limitation, but can be found in a conference version of this paper [41]. The set can also capture a portion of the real axis as

This variation is not fully explored in this paper but will be useful for robustness analysis with respect to real parametric uncertainties (e.g., [42]). It is worth noting that an interval on the real axis can also be treated by the unit circle [43] through the Bliman’s transformation . V. GENERALIZED KYP LEMMA A. Main Theorem This section derives a generalized KYP lemma using the -procedure in (9) or (8); an FDI will be specified by the LHS of each statement, and the RHS then provides an LMI captures condition for satisfaction of the FDI. In particular, the input–output graph of a system in a certain frequency range. To elaborate on this point, let us first discuss the standard KYP would be given by lemma. For the FDI in (1), the set

and to show that the as in (10) by an appropriate choice of possesses the properties in Definition 1 so that the chosen set -procedure is lossless. We will take these steps in the “canonical coordinates” via the bilinear transformation. In particular, can be reduced we first show that the case with general through the simultaneous factorization in to that with (13), and then take the aforementioned two steps for the latter case. and a nonsingular matrix Lemma 4: Let , be given and define , by (13). Consider in in (12), and in (19). Suppose (18), represents curve(s). The following conditions on a given vector are equivalent. holds for some . i) holds for some . ii) Proof: When statement i) holds, there are three possible , b) , , or c) , , cases: a) where the entries of matrix are defined by (15). Hence, in view is of Lemma 3, we see that i) holds if and only if: a) , b) , , unbounded and and , or c) for some such . Through some algebraic manipulations and using that Lemma 13 in Appendix II, it can be verified that the condition is , is unbounded, and equivalent to: a) ; or , , and , b) , is unbounded, and , or for some such that . c) We now see that the condition is equivalent to for some . The previous lemma allows us to convert the -procedure to a standard form. In particular for some is equivalent to

This set can also be characterized (in the spirit of [44]) as for some (17) where

and (18)

We now generalize this characterization of the FDI with respect to the frequency range. Recall that the set in (12) captures finite frequency ranges of our interest. Hence, we consider the general frequency range in (12) for the definition of in (17) where is defined as (if is bounded) (otherwise).

(19)

Finally, the system description is also generalized by considering a completely arbitrary matrix rather than the one with the structure as in (18). It can now be seen that the main technical steps to arrive at in (17) the generalized KYP lemma are to express the set

where , and similarly for the nonstrict inequality case. Note that, if the factorization in (13) is used, the is (a segment of) the imaginary axis, in which set case some prior results already exist [3], [6], [7], [45]. Here, we provide the aforementioned two steps by two lemmas. and , be given Lemma 5: Let by (19) such that in (12) represents curves. Define and and (18), respectively. Then, the set defined in (17) can be characterized by (10) with

(20) Proof: We will first prove that the following two conditions are equivalent: for some ; i) for all , , ; ii) and are defined in (14). First note that i) holds where holds for some such if and only if either a) that , or b) is unbounded (i.e. )

48

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 1, JANUARY 2005

and Since

, where , are defined by . represents curve(s), there are two possibilities: and . In the former case, [3, Cor. 4] proves . It is then straightforward that i) is equivalent to to show that this condition is further equivalent to ii). In the latter case, [6, Lemma 2] can be used to show the equivalence between i) and ii). be defined by (17), and be defined to be in Now, let (10) with (20). Then, for a nonzero vector for some for some for all

where the first and fourth equivalences easily follow from the definitions, and the second and third hold due to Lemma 4 and ii), respectively. the previous equivalence i) is a closed convex cone. When , the set Clearly, is nonempty and hence is admissible. From Lemma 2, we have

is regular and the matrix is a minimal realb) The set in the sense that ization of the set

(21) if and only if (22) We are now ready to state and prove a general theorem. , Theorem 2 (Generalized KYP): Let matrices , and , be given and define and by (12) and (19), respectively. Suppose represents curves on the null space of where the complex plane. Denote by is defined in (18). The following statements are equivalent. . i) ii) There exist , such that and (23) Moreover, if the rank condition in (22) is satisfied, then the following statements are equivalent. . i) such that and ii) There exist , (24)

where case, defining

or

can be assumed. In the former

the set can be characterized as with defined in (11). Since is rank-one separable, it then follows from Lemma 1 that is rank-one separable. Similarly, , the set can be characterized as in the latter case for some matrix and for a subset of positive semidefinite matrices. Again, Lemma 1 infers that is rank-one separable. These arguments are summarized in statement a) of the following lemma. Statement b) gives a condition for regularity, and is proven in Appendix III. Lemma 6: Let matrices , and ( , ) be given such that in (12) represents curves. Define by (20) and the matrix-valued mapping by (18). the set The following statements hold true. a) The set is admissible and rank-one separable.

Proof: We will prove the strict inequality case. The nonstrict inequality case can be shown similarly. Note that (i) holds if and only if holds where is defined in (17). can be characterized by (10) with From Lemma 5, the set in (20). By Lemma 6, the set is admissible and rank-one separable. Hence, from the generalized -procedure (Theorem 1), satisfying condition is equivalent to the existence of , or equivalently, the existence of , such and (23) hold. Since the inequality in (23) is strict, that we can strengthen the positivity of as without loss of generality. In view of the developments in Section IV, the term in statement ii) of Theorem 2 can be specialized to the low/middle/high frequency ranges in the continuous/discretetime settings as shown in the equation at the bottom of the page. When , , , and are all real matrices, and in statement ii) of Theorem 2 can be restricted to be real without loss of generality. This follows basically from the fact that the real part of a complex Hermitian positive–definite matrix is positive definite. Note that, if the frequency region is not symmetric about the real axis, then one needs to search for complex and

IWASAKI AND HARA: GENERALIZED KYP LEMMA

49

even when and are real. In such case, the LMI in complex variables can be converted to an LMI of larger dimension in real variables through the following equivalence:

is controllable. Let be the set of eigenvalues b) the pair of in . Then, the following statements are equivalent. , we have i) For each

ii) There exist where

and

,

such that

and

are real square matrices. (27)

B. Specific Extensions of the KYP Lemma The general result in Theorem 2 can be specialized to several versions of the KYP lemma. There are two aspects of specialization: the frequency range and the underlying system. The former has been briefly discussed at the end of the previous section. We will devote this section to the latter. In particular, we consider FDIs for descriptor and state space systems as well as for polynomial functions. Theorem 3 (Descriptor, Strict Inequality Version): Let ma, , , , and , trices , be given and define and by (12) and (19), respectively. Suppose that: a) represents curves on the complex plane, b) for all , and c) either is nonsingular or is bounded. Then, the following statements are equivalent. i) For , we have for all . such that and ii) There exist , (25) Proof: The result follows from Theorem 2 by choosing (26) If is bounded, then the result is immediate by noting that the is given by for all null space of under the invertibility assumption on . If is unbounded, coincides with the null we need to make sure that the above at as well. This is indeed the case if is space of nonsingular. In Theorem 3, statement ii) implies that for , provided that the upper left block of , denoted all , is positive semidefinite. To see this, let us assume ii) and by for some . Let be a vector in the null space of , i.e., . Denote by the upper block of the matrix on the LHS of (25). Then left

which cannot be true as every terms are nonnegative. By contrafor any diction, we conclude that ii) implies . Based on this observation, Theorem 3 can be modified as follows: Replace supposition b) with , and add the for all to statement i). condition Theorem 4: (State–space, nonstrict inequality version) Let , , , and , matrices be given and define and by (12) and (19), respectively. Suppose that a) represents curves on the complex plane, and

Proof: The result basically follows from the nonstrict inequality case of Theorem 2 by choosing as in (26) with and . Note that the rank condition in (22) translates to and nonsingularity of when . controllability of The only crucial step in this proof is to show that statement i) is equivalent to (28) where is the null space of Theorem 3, can be given by Hence, i) is equivalent to

. As discussed in the proof of if .

By Lemma 12 in Appendix II, this condition is equivalent to (28) under the controllability assumption. Theorems 3 and 4 capture certain properties of dynamical systems expressed in terms of state space, possibly descriptor, realizations. The results include, as special cases, the standard version of the KYP lemma [3], its finite frequency extensions [6], [7], and a unified (continuous/discrete-time) FDI charac) [46]. A terization for the entire frequency range (i.e., and may also be special case of Theorem 3 with obtained through an application of a recent, independent result by Scherer [21]. The following result characterizes certain properties of polynomials, and is particularly useful for the design of digital filters as discussed later. Theorem 5 (Polynomial version): Let matrices , , and , be given and define by (12). Suppose that represents curves on the complex plane. Define (29) Then, the following statements are equivalent. i) holds for all . such that and ii) There exist ,

(30) where .

,

,

, and

50

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 1, JANUARY 2005

The equivalence also holds when the three inequalities are replaced by strict inequalities with the modification that condiis added to statement i) if is unbounded, tion where is the upper left block of . , define for Proof: For a vector . Then and hold where and . Noting that , we see that, for a fixed , condition holds if and only if for all

such that

in different frequency ranges this type, with different sets where . We will address such design problems in the next section. The objective of this section is to show what classes of can be treated within the framework of our generalized KYP lemma. Recall that the FDIs arising in various versions of the KYP lemma (theorems in Section V) are given in terms of quadratic forms. Thus, a set can be completely captured within our framework if it can be described by a quadratic form. The idea is and , a transfer function to construct, from given and a Hermitian matrix such that (31)

If is bounded, the result then directly follows from Theorem 2. When is unbounded, Theorem 2 infers equivalence of i) and ii) if condition for all

such that

for each fixed frequency. If is a polynomial, then the FDI in a given frequency range can be characterized by an LMI using Theorem 5. If it is a transfer function , then the FDI can be expressed as

is added to statement i). This additional condition is equivalent . We claim that this is already implied by the to original statement i) when is unbounded. To see this, note that

This implies that has a positive eigenvalue for with a sufficiently large magnitude, whenever . This completes the proof for the nonstrict inequality case. Finally, the strict inequality case can be proved in the same is not immanner. Note, however, that condition for all in general and, hence, plied by has to be added to statement i). Versions of positive-real lemma for polynomials have been obtained in the standard (unrestricted) frequency setting [47]–[49]. However, results in the restricted frequency setting are limited. References [8]–[10] gave characterizations of scalar-valued quasipolynomials that are nonnegative on the unit circle. Our results extend and unify these prior results to the case of matrix-valued positive definite polynomials in both continuous-time and discrete-time settings with both strict and nonstrict inequality cases. In particular, a convex parametrization of positive quasipolynomials as in [9], [10] can be obtained from Theorem 5; see [50] for details.

VI. FREQUENCY RESPONSE SPECIFICATIONS Consider a transfer function that depends on the deinclude the opensign parameter vector . Examples of , the closed-loop sensitivity funcloop transfer function tions and , and the digital filter as discussed in Section II. The design problem is to find such that

where is a prescribed subset of the complex matrices. More generally, may be required to satisfy multiple constraints of

(32) allows us to treat the FDI Thus, the appropriate choice of within the framework of Theorem 3 (or Theorem 4 when ). Hence, the question is how to construct and so that (31) holds. We will answer this for different classes of in the following sections. A. Half Plane and Circular Regions The simplest choice of is characterized by the FDI in (31) is

. In this case, the set

(33) This set captures some fundamental properties. For instance, the small gain condition is given by

and the positive-real condition is given by

To discuss a more general specification on the Nyquist represents a plot, let us consider the case where single-input–single-output (SISO) system. In this case, for all means that the segment of the lies in the Nyquist plot specified by the frequency range region on the complex plane. As discussed in Section IV, the , region defined by (33) is nontrivial if and only if in which case, it is given by a half plane or a region inside or outside of a circle. Clearly, the small gain (disk) and the positive-real (right-half plane) conditions are characterized by special cases of such regions.

IWASAKI AND HARA: GENERALIZED KYP LEMMA

51

B. General Conic Sections

TABLE I 10 CASES OF VARIOUS REGIONS

is a scalar (SISO) Next we consider the case where transfer function and is a general conic section on the complex plane.3 Recall that a conic section can be characterized by (34) . The following lemma shows that for some real matrix this characterization can be converted to another which is compatible with our description of the frequency domain inequality. be given and consider Lemma 7: Let a real matrix the set of complex numbers defined in (34). This set can also be characterized by

where

is defined by

Proof: Note that

Then, it is straightforward to see that

from which the result follows. Lemma 7 allows us to establish the equivalence in (31) when is a conic section by choosing appropriate and . In parbelongs to the conic section ticular, the transfer function defined in (34) if and only if is chosen as shown in Lemma 7 and is specified as

where

is defined by condition for all for the continuous-time case for the discrete-time case. Thus, for and , the conic section condition can each frequency with the above augmentation, and be converted to the latter can further be converted to an LMI via theorems in Section V. The conic sections captured by (34) are summarized later. The proof is omitted for brevity, and can be found in [50]. be given and conProposition 2: Let a real matrix sider the set in (34). Partition as , e.g.,

Fig. 2.

Essential regions for 4 .

which is parametrized by a vector

satisfying

In particular, it is obtained from one of the ten regions in the -plane (see Table I and Fig. 2) through a rotation by and a . shift by VII. APPLICATIONS TO SYSTEM DESIGN

and denote the spectral decomposition of by where and . Then, the set is nontrivial and has an interior if and only if is indefinite. In this case, is given by the set of such that

This section is devoted to synthesis problems in engineering applications that can be solved using the LMI characterization of the FDI in the generalized KYP lemma. A. Basic Idea for Synthesis

3The developments below can be extended for multiple-input–multiple-output (MIMO) square plants, but the physical meaning is not completely clear.

From the discussion in the previous section, a general synthesis problem can be formulated as follows. For each , let transfer function , Hermitian matrix , and frequency range be given where depends on the

52

parameter vector for some ,

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 1, JANUARY 2005

to be designed, and is specified by (12) . The problem is to find such that (35)

for all . We show how the problem can be solved exactly under certain assumptions. For brevity of exposition, let , i.e., we have just one specification. us consider the case The case with multiple specifications can be handled similarly. In the sequel, we will drop the subscript “ ” in (35). A standing assumption is that the design parameter vector appears affinely in . If is a polynomial, then its is a proper rational coefficients are affine functions of . If transfer function, then it has a state–space realization where and are affine functions of while and are constant matrices. Another assumption is such that is that the weighting matrix (36) where and are the numbers of inputs and outputs of . This condition makes the set of satisfying convex. Note in particular that, when , the design specification (35) defines a convex region in the complex is desired to lie. This framework capplane in which tures such standard specifications as the bounded-real and positive-real properties. Under these assumptions, the condition in (35) is clearly convex in terms of the design parameter . Furthermore, theorems in Section V can be used to reduce the problem to an LMI. We will explicitly show how this can be done using Theorem is a proper transfer function. Other 4 for the case where is a polynomial or a nonproper (descriptor) cases where transfer function can be treated similarly using Theorem 5 or 3, except that the inequality in (35) has to be made strict in the latter case. Lemma 8: For a transfer function , the condition , holds if and only if such that there exist Hermitian matrices and (37) holds, where

provided (36) holds. Proof: This result follows directly from Theorem 4 with in (32) and the Schur complement. When and are affine functions of the design parameter vector , (37) defines an LMI in terms of the variables , , and . Therefore, can be computed via convex programming.

The reduction of the synthesis problem (35) to LMIs is rather straightforward once the proper assumptions are imposed. Attention must be paid, however, to when the assumptions are satisfied, and how they can be enforced through reparametrization of the design parameters. We will discuss these points for several engineering problems in the sequel. Structure/control design integration: The specification is given by the FFPR condition in (6), which can be converted to an LMI via Theorem 4. The design parameters would apfor certain problem settings, pear only in the numerator of including actuator or sensor placement. The design space may is affine in the new design then be reparametrized such that parameter vector . In this case, the design process is reduced to an LMI problem, which can be solved via semidefinite programming. This situation occurs in some practical design problems [7], [31]. Open-loop shaping: The control design via open-loop shaping discussed in Section II falls into the aforementioned framework if the poles of the controller are fixed a priori. An important practical problem that naturally fits in here is the PID control design. The controller transfer function is described by

(38) is a small parameter introduced to approximate where the differentiator by a proper transfer function. If is fixed, all the design parameters ( , , and ) appear linearly in the open-loop transfer function . Various specifications on the Nyquist plot of may be expressed as confor all where straints of the form is a frequency interval and is the corresponding convex region on the complex plane such as a disk, half plane, and a conic section. The idea described in Section VI can be used to in terms of quadratic forms of some (possibly represent as in (35). The design paramaugmented) transfer function eters will enter linearly and, hence, the PID controller can be designed by solving an LMI as in Lemma 8. Closed-loop control design: This is a fundamental problem of designing a controller to satisfy various constraints on the , including the sensiclosed-loop transfer functions tivity-shaping problem discussed in Section II as a special case. With the Youla parametrization of all stabilizing controllers [51], each closed-loop transfer function depends on the free in an affine manner. Thus the search for is parameter a convex problem, provided the constraints are convex in terms of the closed-loop transfer functions [52]. Since the infinite dimensionality of the parameter space is problematic, one often approximates the space by a finite dimensional space spanned by a selected set of basis functions. The problem then becomes the search for the coefficients of the basis functions, which can be converted to an LMI problem within the above framework satisfying if the specifications are given by (35) with each (36). Digital filter design: This is another problem discussed in Section II. Let us first consider the finite impulse response (FIR) filter synthesis problem. The denominator of the FIR filter

IWASAKI AND HARA: GENERALIZED KYP LEMMA

is fixed to

53

, and a state–space

realization of (39) may be given by the pair form and

in the controllable canonical

where are the coefficients of . Note that and are fixed matrices and the design parameters appear linearly in and . Moreover, it is not difficult to see that . Therefore, the constraints in (5) are convex in terms of Lemma 8 can be directly applied. Next, we consider the infinite impulse response (IIR) filter and are design variables. The design where both problem is fundamentally more difficult than the FIR case because the above specifications are not convex in terms of , and the simple-minded approach described previously does not yield LMI problems due to the dependence of on , the coefficients of . However, using the ideas of [9], [10], [53], and [54], the problem can be made tractable when the specifications are given by inequality constraints only. For instance, one can handle the on the gain specifications (5) if the third constraint is replaced by

This type of gain specifications can be converted to FDIs in terms of polynomials based on the Fejer–Riesz result [54, Th. 8.4.5]. Hence, the generalized KYP lemma for polynomials (Theorem 5) will be useful here. Finally, in certain applications, design objectives may be reasonably well described by gain constraints only, but the IIR filter design with both gain and phase constraints still remains as an important practical problem that needs to be addressed. B. Design examples 1) PID Controller: We consider the design of PID controllers (38) for a mechanical plant with a lightly damped flexible mode

The design objective is to have a good tracking performance with a reasonable stability margin and robustness against unmodeled dynamics. Our approach is to shape the Nyquist plot . In parof the open-loop transfer function ticular, we consider the following specifications: a) b) c) Specification a) with small ensures robustness against unmodeled dynamics which typically exists in the high frequency

range. Specification b) is meant to guarantee a certain stability margin. It may be more natural to require the Nyquist plot to be . Howoutside of a circle with its center at the point ever, such requirement leads to a nonconvex region on the complex plane and our design method cannot be applied as condition (36) is not satisfied. On the other hand, the above Spec ification b) is a half-plane requirement which can be handled within our framework. The frequency range corresponding to Specification b) would be the entire imaginary axis, but a small lower bound has been introduced to exclude from the range and to avoid a possible numerical difficulty due to the controller ensures pole at the origin. Specification c) with a large sensitivity reduction in the low frequency range by making high gain. Again, would be more direct for the purpose but does not define a convex constraint on . Specification c) is a reasonable alternative, given that the phase angle of apas approaches 0 from above. proaches We have designed a PID controller by minimizing subject to Specifications a)–c) where the other parameters are fixed as

The resulting optimal PID controller is found to be

with the optimal value . The Nyquist plot, sensitivity functions, and the step response are depicted in Figs. 3–5, respectively. Also plotted for comparison are the result of a popular heuristic PID tuning rule (Ziegler–Nichols ultimate sensitivity method) [55] where the PID parameters are given by

The stars and the (small) circles in Fig. 3 indicate the points on the Nyquist plots at frequencies and , respectively. We see from Fig. 3 that the constraints in Specifications b) and c) are active for the optimal PID control, and the optimization clearly has improved the stability margin and the sensitivity reduction in the low frequency range. The latter effect is transparent in Fig. 4, which shows that the optimization added more damping to the closed-loop poles than the Ziegler–Nichols design. This can also be seen directly from the oscillatory nature of the step responses plotted in Fig. 5. Overall, this example illustrates that our method allows for the tuning of the PID parameters to achieve desired frequency responses in prescribed frequency ranges. 2) FIR Filter: We consider the design of low-pass FIR filters to meet the following specifications: a) b) where is the filter transfer function to be designed. Spec, that the ification a) requires, within the pass-band has frequency characteristics similar to the desired filter function having unity gain and the linear phase with group

54

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 1, JANUARY 2005

Fig. 3. Nyquist plots. (Left) Optimized. (Right) Ziegler-Nichols.

Fig. 4.

Sensitivity functions. (Left) Optimized. (Right) Ziegler-Nichols.

Fig. 5. Step responses. (Solid) Optimized. (Dashed) Ziegler-Nichols.

delay , where is a positive integer.4 Specification b) ensures has a low gain in the stop-band . Let be that the order of the FIR filter to be designed. For fixed , , , , and , we design an FIR filter by minimizing subject to the above constraints. This is a Chebyshev approximation problem where the peak value of the frequency response is used as the design criteria. Most of the currently available methods for solving the Chebyshev approximation problem are based on the frequency 4A transfer function H (z ) is said to have a linear phase if it can be represented as H (z ) = jH (z )jz on the unit circle for some real positive scalar  . The parameter  is called the group delay.

gridding [22], [23], [25]. Some recent methods avoid frequency gridding using KYP lemmas [9], [10], [53], but these methods can deal only with a subclass of the problem where all the FDIs . No methods have been are given in terms of the gain of known, to our knowledge, to solve the FIR Chebyshev approximation problem with phase constraints as formulated above, without gridding the frequency axis. Our method naturally handles such problem. A standard approach in the literature to achieve the linear phase for FIR filters is to make the FIR coefficients symis even, it can be metric [22], [56]. For instance, when shown that the constraint on the FIR coefficients in (39) enforces the linear phase with group delay . With this symmetry constraint, Specification a) can be equivalently written as a condition on the gain only a) where we used the fact that holds on the unit with . Thus, the circle for a symmetric FIR filter design problem is exactly solvable by existing methods [9], [10] under the symmetry constraint. A drawback of this approach is that some design freedom is wasted to enforce the unnecessary constraint of the linear phase property in the stop-band. By removing the unnecessary constraint, the design freedom may be used to improve the overall

IWASAKI AND HARA: GENERALIZED KYP LEMMA

Fig. 6.

55

FIR responses. (a) n = 30, d = 15. (b) n = 30, d = 10. (c) n = 20, d = 10. (b’) n = 30, d = 10.

performance. We will illustrate this point by numerical examples in the sequel. We fix the frequency ranges and the gain bound in the stop-band as follows:

advantage of our method that allows us to treat the linear phase constraint in a finite frequency range, as opposed to the existing methods that enforce the constraint for all frequencies by symmetry of the FIR coefficients. VIII. CONCLUSION

and design several FIR filters for different choices of and . Fig. 6(a) shows the gain response of a typical symmetric FIR and , where the frequency is filter with normalized by . This filter is obtained as the optimal solution to the above Chebyshev approximation problem. The optimal . Now, consider performance value is found to be a situation where the group delay of time steps is too large and is not acceptable. Such small delay requirement would be important for certain real-time applications [25], [57]. Supis the desired group delay. Then, the standard pose that framework of symmetric FIR filters requires that the filter order . The gain response of the optimal FIR filter for be this case is shown in Fig. 6(c). The optimal performance value which is significantly larger than the previous is design, as seen by the nonflatness of the gain response in the pass-band. On the other hand, our design method is free from and, for instance, the group delay of the constraint may be (approximately) achieved with a filter of order equal to the original design, i.e., , as shown in Fig. 6(b) and (b’). We see that, within the pass-band, the linear phase requirement is practically met and the gain response is almost as good as the original . This example shows the

We have generalized the KYP lemma to allow for more flexibility in system classes and various frequency ranges. In particular, the system class is described by the null space of a certain frequency-dependent matrix, capturing descriptor systems and polynomials in addition to the standard state space systems. The frequency range is specified by two quadratic forms that define segment(s) of a circle or a line on the complex plane. In this way, an FDI condition can be considered in the low/middle/high frequency ranges for both continuous-time and discrete-time settings in a unified manner. The generalized KYP lemma converts a certain FDI in a finite frequency range to a numerically tractable LMI condition. When system design specifications are expressed in terms of such FDIs, the generalized KYP lemma may be used to reduce the problem to an LMI optimization. We have given some technical conditions under which this reduction is possible, and discussed various engineering applications that fit into the framework, including the design of digital filters, feedback controllers with fixed poles, and mechanical structures. Further developments have been reported at conferences, addressing such issues as the gain feedback design [58], controller order reduction, a robust generalized KYP lemma [59], and a GKYP design

56

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 1, JANUARY 2005

toolbox with MATLAB [60]. General control problems with dynamic output feedback, however, still remain open for further research. APPENDIX I SIMULTANEOUS MATRIX FACTORIZATION In this section, we prove Lemma 2. The following result is useful for this purpose. Lemma 9: Let be given. Then, admits the following factorization: (40) where , ,

and

with

In particular, and are the eigenvalues of Proof: Let the entries of be defined by

, then Lemma 9 guarantees it can holds. Let . Since and are the be decomposed as in (40) with , they can be ordered so that . eigenvalues of Then, the factorizations of and in (13) are obtained from (42) and (40) by noting the relation (41) and defining . APPENDIX II TECHNICAL LEMMAS and be given. Suppose Lemma 10: Let is an admissible, regular set. Then the following statethat ments are equivalent. such that . i) There exists , there exists such that ii) For each . ii) is obvious. To show the Proof: The implication i) converse, consider the following. such that , where iii) There exists

. We will show ii)

iii) i). Suppose iii) does not hold. Then, for all . Since is closed, is compact and, hence, there exists such that Then Let The spectral factorization of

be arbitrary and define

gives

Then, we have be verified that where the columns of are eigenvectors and and are eigenvalues. Note that , , and are all real. Moreover, since is real symmetric, can be chosen to satisfy . Thus, belongs to . Now, equating the two ex, we have pressions for

Finally, it can readily be verified that (41) holds for any . Substituting this expression into the second term, we obtain the result. Proof of Lemma 2: Since , there exists a nonsuch that singular matrix (42)

since is a cone, and it can readily . Therefore

Since is arbitrary, we see that ii) does not hold. Thus, we have shown ii) iii). Now, suppose iii) holds and let be such that . If , then and regularity implies , contradicting . So , in which case, is an element of and , proving i). satisfies Lemma 11: [19], [61] Let be a convex subset of , and be an affine function. The following statements are equivalent. i) The set is empty. ii) There exists nonzero such that is nonnegative for all . and , Lemma 12: Let matrices and sets of complex numbers and be given such that and the closure of coincides with . Denote by the null space of where is defined in (18). Suppose holds for all . Then, holds rank for all if and only if it holds for .

IWASAKI AND HARA: GENERALIZED KYP LEMMA

57

Proof: The necessity is obvious. To prove the sufficiency, such that does suppose that there exists such that not hold. Then, there exists a vector and . Let . Then, and holds for all . we have is continuous on due to the rank assumption, Note that holds if is sufficiently close to and hence . Such can be chosen from because is . Thus, we have shown that if an element of the closure of holds for all , then it must also hold for . all in (12). Suppose it repLemma 13: Consider the set is resents curves on the complex plane. Then the set and . unbounded if and only if and . Then, is Proof: Suppose a straight line, while is a half plane or the outside of a , is clearly unbounded. circle. Hence, their intersection, This proves the sufficiency. The necessity can be proved by con, then is a circle and its subset tradiction. If must be bounded. If , then for with sufficiently large magnitude and hence is bounded. must be bounded. Therefore, its subset

APPENDIX III PROOF OF LEMMA 6 Statement a) has been proven in the paragraph above the lemma. We will prove statement b). It is tedious but straightforward to verify that (22) holds if and only if

holds where is an arbitrarily fixed nonsingular matrix. Then, b) is equivalent to the statement obtained by replacing , , , and in b). Hence, without and by , loss of generality, we prove b) for the case where and where and are given by (14). Note that is regular and (21) holds if and only if

(43) We first show the “only if” part of the proof. Suppose there exists such that rank . Then there exists . Let if a nonzero vector such that or and if . It can be verified that the is a counter example to (43). This choice proves necessity. To prove the converse, suppose that the rank condition (22) is has full row rank and hence there exists a satisfied. Then nonsingular matrix that transforms into the following form:

Note that (

,

) is controllable because

Let , be such that and . We will show that ( , ) is controllable, there exists has no eigenvalues on . Then

. Since such that

If or , then it follows from [3], [45] that implies under controllability of . implies , eliminating So we will show that with the remaining possibility. For , we have

Since represents curve(s), implies . , given by In this case, the relative interior of , is nonempty. We then have for all . Since , this implies has infinitely many roots in , and hence that must vanish identically. It then follows from Theorem 1 due to controllability (statement 12) in [62, Sec. 34] that of . ACKNOWLEDGMENT The would like to thank I. Yamada and H. Hasegawa at the Tokyo Institute of Technology (TIT), Japan, for their helpful comments on filtering design, and Y. Iwatani at TIT for his help with the PID design example. They would also like to thank C. Scherer at Delft University of Technology, The Netherlands, for providing them with a preprint of [21] accompanied by his insightful comments, to K. Murota at the University of Tokyo for helpful discussions on the -procedure, and to A. L. Fradkov at the Institute for Problems of Mechanical Engineering for directing their attention to the connection between [6] and [17], which has lead to the necessity proof of Theorem 1. REFERENCES [1] B. D. O. Anderson, “A system theory criterion for positive real matrices,” SIAM J. Control, vol. 5, pp. 171–182, 1967. [2] J. C. Willems, “Least squares stationary optimal control and the algebraic Riccati equation,” IEEE Trans. Autom. Control, vol. AC-16, no. 6, pp. 621–634, Dec. 1971. [3] A. Rantzer, “On the Kalman-Yakubovich-Popov lemma,” Syst. Control Lett., vol. 28, no. 1, pp. 7–10, 1996. [4] S. Boyarski and U. Shaked, Discrete-Time and Control of systems with structured disturbances under probability requirements, 2003.

H

H

58

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 50, NO. 1, JANUARY 2005

H

[5] C. Scherer, “Robust -controller design under structured noise-uncertainty,” presented at the IFAC World Congr., Barcelona, Spain, 2002. [6] (2000) Mathematical Problems in Engineering [Online]. Available: http://mpe.hindawi.com/volume-6/S1 024 123X00 001 368.html [7] T. Iwasaki, S. Hara, and H. Yamauchi, “Dynamical system design from a control perspective: finite frequency positive-realness approach,” IEEE Trans. Autom. Control, vol. 48, no. 8, pp. 1337–1354, Aug. 2003. [8] Y. Nesterov, “Squared functional systems and optimization problems,” in High Performance Optimization, H. Frenk et al., Eds. Norwell, MA: Kluwer, 2000, pp. 405–440. [9] Y. Genin, Y. Hachez, Y. Nesterov, and P. Van Dooren, “Convex optimization over positive polynomials and filter design,” in Proc. UKACC Int. Conf. Control, 2000. [10] T. N. Davidson, Z.-Q. Luo, and J. F. Sturm, “Linear matrix inequality formulation of spectral mask constraints with applications to FIR filter design,” IEEE Trans. Signal Processing, vol. 50, no. 11, pp. 2702–2715, Nov. 2002. [11] F. P. Gantmacher and V. A. Yakubovich, “Absolute stability of nonlinear control systems,” in Proc. 2nd All-Union Session on Theoretical and Applied Mechanics, Moscow, Russia, 1966. [12] V. A. Yakuboviˇc, S-Procedure in Nonlinear Control Theory. Leningrad, Russia: Vestnik Leningrad Univ., 1971, vol. 1, pp. 62–77. [13] Linear Matrix Inequalities in System and Control Theory, 1994. [14] T. Iwasaki and S. Hara, “Well-posedness of feedback systems: Insights into exact robustness analysis and approximate computations,” IEEE Trans. Autom. Control, vol. 43, no. 5, pp. 619–630, May 1998. [15] A. Megretski and A. Rantzer, “System analysis via integral quadratic constraints,” IEEE Trans. Autom. Control, vol. 42, no. 6, pp. 819–830, Jun. 1997. [16] C. W. Scherer, “LPV control with full block multipliers,” Automatica, vol. 37, no. 3, pp. 361–375, 2001. [17] A. L. Fradkov, “Duality theorems for certain nonconvex extremal problems,” Sibirskii Matematicheskii Zhurnal, vol. 14, no. 2, pp. 357–383, Mar.–Apr. 1973. [18] A. Packard and J. Doyle, “The complex structured singular value,” Automatica, vol. 29, no. 1, pp. 71–109, 1993. [19] G. Meinsma, Y. Shrivastava, and M. Fu, “A dual formulation of mixed  and on the losslessness of (D; G) scaling,” IEEE Trans. Autom. Contr., vol. 42, no. 7, pp. 1032–1036, Jul. 1997. [20] D. Henrion and G. Meinsma, “Rank-one LMIs and Lyapunov’s inequality,” IEEE Trans. Autom. Control, vol. 46, no. 8, pp. 1285–1288, Aug. 2001. [21] C. Scherer, “When are multiplier relaxations exact?,” in Proc. IFAC Symp. Robust Control, 2003. [22] T. W. Parks and J. H. McClellan, “Chebyshev approximation for nonrecursive digital filters with linear phase,” IEEE Trans. Circuit Theory, vol. CT-19, no. 2, pp. 189–194, Feb. 1972. [23] X. Chen and T. W. Parks, “Design of FIR filters in the complex domain,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-35, no. 2, pp. 144–153, Feb. 1987. [24] K. Preuss, “On the design of FIR filters by complex Chebyshev approximation,” IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 5, pp. 702–712, May 1989. [25] L. J. Karam and J. H. McClellan, “Chebyshev digital FIR filter design,” Signal Process., vol. 76, pp. 17–36, 1999. [26] K. Zhou, J. Doyle, and K. Glover, Robust and Optimal Control. Upper Saddle River, NJ: Prentice-Hall, 1996. [27] T. Glad and L. Ljung, Control Theory—Multivariable and Nonlinear Methods. New York: Taylor Francis, 2000. [28] A. Messac and K. Malec, “Control-structure integrated design,” AIAA J., vol. 30, no. 8, pp. 2124–2131, 1992. [29] P. Maghami, S. Joshi, and D. Price, “Integrated controls-structures design methodology for flexible space craft,” J. Spacecrafts Rockets, vol. 32, no. 5, pp. 839–844, 1995. [30] K. M. Grigoriadis, G. Zhu, and R. E. Skelton, “Optimal redesign of linear systems,” ASME J. Dyna. Syst. Meas. Control, vol. 118, pp. 596–605, 1996. [31] S. Hara, T. Iwasaki, and F. Shimizu, “Finite frequency characterization of easily controllable mechanical systems under control effort constraint,” in Proc. IFAC World Congr., 2002. [32] V. A. Yakuboviˇc, “Minimization of quadratic functionals under quadratic constraints and the necessity of a frequency condition in the quadratic criterion for absolute stability of nonlinear control systems,” Soviet Math. Dokl., vol. 14, no. 2, pp. 593–597, 1973. [33] A. L. Fradkov and V. A. Jakubovic, The S-Procedure and a Duality Relations in Nonconvex Problems of Quadratic Programming. Leningrad, Russia: Vestnik Leningrad Univ. Math., 1979, vol. 6, pp. 101–109.

[34] R. E. Skelton, T. Iwasaki, and K. M. Grigoriadis, A Unified Algebraic Approach to Linear Control Design. New York: Taylor Francis, 1997. [35] T. Iwasaki, LMI and Control (in Japanese). Tokyo, Japan: Shokodo, 1997. [36] M. Fan, A. Tits, and J. Doyle, “Robustness in the presence of mixed parametric uncertainty and unmodeled dynamics,” IEEE Trans. Autom. Control, vol. 36, no. 1, pp. 25–38, Jan. 1991. [37] J. C. Doyle, “Analysis of feedback systems with structured uncertainties,” Proc. Inst. Elect. Eng. D, vol. 129, no. 6, pp. 242–250, Nov. 1982. [38] R. T. Rockafellar, Convex Analysis. Princeton, NJ: Princeton Univ. Press, 1970. [39] Y. Ebihara, K. Maeda, and T. Hagiwara, “Generalized s-procedure for inequality conditions on one-vector-lossless sets and linear system analysis,” in Proc. IEEE Conf. Decision Control, 2004. [40] D. Henrion, M. Sebek, and V. Kucera, “Positive polynomials and robust stabilization with fixed-order controllers,” IEEE Trans. Autom. Control, vol. 48, no. 7, pp. 1178–1186, Jul. 2003. [41] T. Iwasaki and S. Hara, “Generalization of Kalman-Yakubovic-Popov lemma for restricted frequency inequalities,” in Proc. Amer. Control Conf., 2003. [42] X. Zhang, P. Tsiotras, and T. Iwasaki, “Parameter-dependent Lyapunov functions for stability analysis of LTI parameter dependent systems,” in Proc. IEEE Conf. Decision Control, 2003. [43] P.-A. Bliman, “A convex approach to robust stability for linear systems with uncertain scalar parameters,” SIAM J. Control Optim., vol. 42, no. 6, pp. 2016–2042, 2004. [44] T. Iwasaki and G. Shibata, “LPV system analysis via quadratic separator for uncertain implicit systems,” IEEE Trans. Autom. Control, vol. 46, no. 8, pp. 1195–1208, Aug. 2001. [45] V. Balakrishnan and L. Vandenberghe, “Semidefinite programming duality and linear time-invariant systems,” IEEE Trans. Autom. Control, vol. 48, no. 1, pp. 30–41, Jan. 2003. [46] A. N. Churilov, On the solvability of some matrix inequalities. Leningrad, Russia: Vestnik Leningrad State Univ., 1980, vol. 2, pp. 51–55. [47] Y. Genin, Y. Nesterov, and P. Van Dooren, “Positive transfer functions and convex optimization,” in Proc. Euro. Control Conf., 1999. [48] Y. Genin, Y. Nesterov, R. Stefan, P. Van Dooren, and S. Xu, “Positivity and linear matrix inequalities,” Eur. J. Control, vol. 8, no. 3, pp. 275–298, 2002. [49] Y. Genin, Y. Hachez, Y. Nesterov, and P. Van Dooren, “Optimization problems over positive pseudopolynomial matrices,” SIAM J. Matrix Anal. Appl., vol. 25, no. 1, pp. 57–79, 2003. [50] T. Iwasaki and S. Hara. (2003, Aug.) Generalized KYP lemma: Unified characterization of frequency domain inequalities with applications to system design. Univ. Tokyo, Tokyo, Japan. [Online]. Available: http://www.keisu.t.u-tokyo.ac.jp/Research/techrep. 0.html [51] D. C. Youla, H. A. Jabr, and J. J. Bongiorno, “Modern Wiener–Hopf design of optimal controllers: Part 2,” IEEE Trans. Autom. Control, vol. AC-21, no. 3, pp. 319–338, Jun. 1976. [52] S. Boyd and C. H. Barratt, Linear Controller Design: Limits of Performance. Upper Saddle River, NJ: Prentice-Hall, 1991. [53] J. Tuqan and P. P. Vaidyanathan, “A state–space approach to the design of globally optimal FIR energy compaction filters,” IEEE Trans. Signal Process., vol. 48, no. 10, pp. 2822–2838, Oct. 2000. [54] W. Rudin, Fourier Analysis on Groups. New York: Wiley, 1962. [55] K. J. Astrom and B. Wittenmark, Computer-Controlled Systems—Theory and Design. New York: Prentice-Hall, 1997. [56] L. R. Rabiner, “Techniques for designing finite-duration impulse-response digital filters,” IEEE Trans. Commun. Technol., vol. CT-19, no. 2, pp. 188–195, Feb. 1971. [57] X. Zhang and T. Yoshikawa, “Design of FIR Nyquist filters with low group delay,” IEEE Trans. Signal Process., vol. 47, no. 5, pp. 1454–1458, May 1999. [58] T. Iwasaki and S. Hara, “Robust control synthesis with general frequency domain specifications: Static gain feedback case,” in Proc. Amer. Control Conf., 2004. [59] S. Hara, D. Shiokata, and T. Iwasaki, “Fixed order controller design via generalized KYP lemma,” in Proc. IEEE Conf. Control Applications, 2004. [60] D. Shiokata, S. Hara, and T. Iwasaki, “From Nyquist/Bode to GKYP design: design algorithms and cacsd tools,” in Proc. SICE Annu. Conf., 2004. [61] D. G. Luenberger, Optimization by Vector Space Methods. New York: Wiley, 1968. [62] V. M. Popov, Hyperstability of Control Systems. New York: SpringerVerlag, 1973.

IWASAKI AND HARA: GENERALIZED KYP LEMMA

Tetsuya Iwasaki (S’89–M’94–SM’01) received the B.E. and M.E. degrees in electrical and electronic engineering from the Tokyo Institute of Technology, Tokyo, Japan, in 1987 and 1990, respectively, and the Ph.D. degree from the School of Aeronautics and Astronautics, Purdue University, West Lafayette, IN, in 1993. He held a postdoctoral position at Purdue University. After holding a faculty position for five years at the Tokyo Institute of Technology, he moved to the University of Virginia, Charlottesville, in May 2000, where he is currently a Professor. His research interests include robust and optimal control, and modeling and control of biological sensing, locomotion, and oscillation. Dr. Iwasaki received the 2002 Pioneer Prize from the Society of Instrument and Control Engineers, and the 2003 National Science Foundation CAREER Award. He is a past Associate Editor of the IEEE TRANSACTIONS ON AUTOMATIC CONTROL, and currently is on the Editorial Boards of Automatica and Systems and Control Letters.

59

Shinji Hara (M’87–SM’04) was born in Izumo, Japan, in 1952. He received the B.S., M.S., and Ph.D. degrees in engineering from the Tokyo Institute of Technology, Tokyo, Japan, in 1974, 1976, and 1981, respectively. From 1976 to 1980, he was a Research Member of Nippon Telegraph and Telephone Public Corporation, Japan. He served as a Research Associate at the Technological University of Nagaoka, Japan, from 1980 to 1984. In 1984, he joined the Tokyo Institute of Technology as an Associate Professor and has served as a Full Professor for ten years. Since 2001, he has been a Full Professor in the Department of Information Physics and Computing, The University of Tokyo. His current research interests are in robust control, sampled-data control, hybrid control, learning control, quantum control, and computational aspects of control system design. He is a Member of SICE and ISCIE. Dr. Hara was a BoG member of the IEEE Control Systems Society, the General Chair of the CCA04, and an Associate Editor of the IEEE TRANSACTIONS ON AUTOMATIC CONTROL and Automatica. He received Best Paper Awards from SICE (the Society of Instrument and Control Engineers, Japan) in 1987, 1991, 1992, 1997, and 1998, from the Japan Society for Simulation Technology in 2001, and from ISCIE in 2002.