Sublinear function

From Wikipedia, the free encyclopedia
(Redirected from Sublinear)
Jump to navigation Jump to search

In linear algebra, a sublinear function (or functional as is more often used in functional analysis), also called a quasi-seminorm, on a vector space is a real-valued function with some of the properties of a seminorm. Unlike seminorms, a sublinear function does not have to be nonnegative-valued and also does not have to be absolutely homogeneous. Seminorms are themselves abstractions of the more well known notion of norms, where a seminorm has all the defining properties of a norm except that it is not required to map non-zero vectors to non-zero values.

In functional analysis the name Banach functional is sometimes used, reflecting that they are most commonly used when applying a general formulation of the Hahn–Banach theorem. The notion of a sublinear function was introduced by Stefan Banach when he proved the Hahn-Banach theorem.[1]

There is also a different notion in computer science, described below, that also goes by the name "sublinear function."

Definitions

[edit | edit source]

Let X be a vector space over a field 𝕂, where 𝕂 is either the real numbers ℝ or complex numbers β„‚. A function p:X→ℝ is called a sublinear if it has these two properties:[1]

  1. Positive homogeneity,[2] that is p(rx)=rp(x), for all rβ‰₯0 and x∈X.
  2. Subadditivity,[2] that is p(x+y)≀p(x)+p(y) for x,y∈X.

A function p:X→ℝ is called positive[3] or nonnegative if p(x)β‰₯0 for all x∈X, although some authors[4] define positive to instead mean that p(x)β‰ 0 whenever xβ‰ 0; these definitions are not equivalent. It is a symmetric function if p(βˆ’x)=p(x) for all x∈X. Every subadditive symmetric function is necessarily nonnegative.[proof 1] A sublinear function on a real vector space is symmetric if and only if it is a seminorm. A sublinear function on a real or complex vector space is a seminorm if and only if it is a balanced function or equivalently, if and only if p(ux)≀p(x) for every unit length scalar u and x∈X.

The set of all sublinear functions on X, denoted by X#, can be partially ordered by declaring p≀q if and only if p(x)≀q(x) for all x∈X. A sublinear function is called minimal if it is a minimal element of X# under this order. A sublinear function is minimal if and only if it is a real linear functional.[1]

Examples and sufficient conditions

[edit | edit source]

Every norm, seminorm, and real linear functional is a sublinear function. The identity function on ℝ is an example of a sublinear function (in fact, it is even a linear functional) that is neither positive nor a seminorm; the same is true of this map's negation xβ†¦βˆ’x.[5] More generally, for any real a≀b, the map

Sa,b:ℝ→ℝ,x↦{ax,if x≀0,bx,if xβ‰₯0

is a sublinear function on ℝ and moreover, every sublinear function p:ℝ→ℝ is of this form; specifically, if a=βˆ’p(βˆ’1) and b=p(1) then a≀b and p=Sa,b.

If p and q are sublinear functions on a real vector space X then so is the map x↦max{p(x),q(x)}. More generally, if 𝒫 is any non-empty collection of sublinear functionals on a real vector space X and if for all x∈X, q(x)=sup{p(x);pβˆˆπ’«}, then q is a sublinear functional on X.[5]


A function p:X→ℝ which is subadditive, convex, and satisfies p(0)≀0 is also positively homogeneous (the latter condition p(0)≀0 is necessary as the example of p(x)=x2+1 on X=ℝ shows). If p is positively homogeneous, it is convex if and only if it is subadditive. Therefore, assuming p(0)≀0, any two properties among subadditivity, convexity, and positive homogeneity implies the third.

Properties

[edit | edit source]

Every sublinear function is a convex function: For 0≀t≀1, p(tx+(1βˆ’t)y)≀p(tx)+p((1βˆ’t)y) subadditivity=tp(x)+(1βˆ’t)p(y) nonnegative homogeneity

If p:X→ℝ is a sublinear function on a vector space X then[proof 2][3] p(0)=0≀p(x)+p(βˆ’x), for every x∈X, which implies that at least one of p(x) and p(βˆ’x) must be nonnegative; that is, for every x∈X,[3] 0≀max{p(x),p(βˆ’x)}. Moreover, when p:X→ℝ is a sublinear function on a real vector space then the map q:X→ℝ defined by q(x)=defmax{p(x),p(βˆ’x)} is a seminorm.[3]

Subadditivity of p:X→ℝ guarantees that for all vectors x,y∈X,[1][proof 3] p(x)βˆ’p(y)≀p(xβˆ’y), βˆ’p(x)≀p(βˆ’x), so if p is also symmetric then the reverse triangle inequality will hold for all vectors x,y∈X, |p(x)βˆ’p(y)|≀p(xβˆ’y).

Defining kerp=defpβˆ’1(0), then subadditivity also guarantees that for all x∈X, the value of p on the set x+(kerpβˆ©βˆ’kerp)={x+k:p(k)=0=p(βˆ’k)} is constant and equal to p(x).[proof 4] In particular, if kerp=pβˆ’1(0) is a vector subspace of X then βˆ’kerp=kerp and the assignment x+kerp↦p(x), which will be denoted by p^, is a well-defined real-valued sublinear function on the quotient space X/kerp that satisfies p^βˆ’1(0)=kerp. If p is a seminorm then p^ is just the usual canonical norm on the quotient space X/kerp.

Pryce's sublinearity lemma[2]β€”Suppose p:X→ℝ is a sublinear functional on a vector space X and that KβŠ†X is a non-empty convex subset. If x∈X is a vector and a,c>0 are positive real numbers such that p(x)+ac<infk∈Kp(x+ak) then for every positive real b>0 there exists some 𝐳∈K such that p(x+a𝐳)+bc<infk∈Kp(x+a𝐳+bk).

Adding bc to both sides of the hypothesis p(x)+ac<infp(x+aK) (where p(x+aK)=def{p(x+ak):k∈K}) and combining that with the conclusion gives p(x)+ac+bc<infp(x+aK)+bc≀p(x+a𝐳)+bc<infp(x+a𝐳+bK) which yields many more inequalities, including, for instance, p(x)+ac+bc<p(x+a𝐳)+bc<p(x+a𝐳+b𝐳) in which an expression on one side of a strict inequality < can be obtained from the other by replacing the symbol c with 𝐳 (or vice versa) and moving the closing parenthesis to the right (or left) of an adjacent summand (all other symbols remain fixed and unchanged).

Associated seminorm

[edit | edit source]

If p:X→ℝ is a real-valued sublinear function on a real vector space X (or if X is complex, then when it is considered as a real vector space) then the map q(x)=defmax{p(x),p(βˆ’x)} defines a seminorm on the real vector space X called the seminorm associated with p.[3] A sublinear function p on a real or complex vector space is a symmetric function if and only if p=q where q(x)=defmax{p(x),p(βˆ’x)} as before.

More generally, if p:X→ℝ is a real-valued sublinear function on a (real or complex) vector space X then q(x)=defsup|u|=1p(ux)=sup{p(ux):u is a unit scalar } will define a seminorm on X if this supremum is always a real number (that is, never equal to ∞).

Relation to linear functionals

[edit | edit source]

If p is a sublinear function on a real vector space X then the following are equivalent:[1]

  1. p is a linear functional.
  2. for every x∈X, p(x)+p(βˆ’x)≀0.
  3. for every x∈X, p(x)+p(βˆ’x)=0.
  4. p is a minimal sublinear function.

If p is a sublinear function on a real vector space X then there exists a linear functional f on X such that f≀p.[1]

If X is a real vector space, f is a linear functional on X, and p is a positive sublinear function on X, then f≀p on X if and only if fβˆ’1(1)∩{x∈X:p(x)<1}=βˆ….[1]

Dominating a linear functional

[edit | edit source]

A real-valued function f defined on a subset of a real or complex vector space X is said to be dominated by a sublinear function p if f(x)≀p(x) for every x that belongs to the domain of f. If f:X→ℝ is a real linear functional on X then[6][1] f is dominated by p (that is, f≀p) if and only if βˆ’p(βˆ’x)≀f(x)≀p(x) for every x∈X. Moreover, if p is a seminorm or some other symmetric map (which by definition means that p(βˆ’x)=p(x) holds for all x) then f≀p if and only if |f|≀p.

Theorem[1]β€”If p:X→ℝ be a sublinear function on a real vector space X and if z∈X then there exists a linear functional f on X that is dominated by p (that is, f≀p) and satisfies f(z)=p(z). Moreover, if X is a topological vector space and p is continuous at the origin then f is continuous.

Continuity

[edit | edit source]

Theorem[7]β€”Suppose f:X→ℝ is a subadditive function (that is, f(x+y)≀f(x)+f(y) for all x,y∈X). Then f is continuous at the origin if and only if f is uniformly continuous on X. If f satisfies f(0)=0 then f is continuous if and only if its absolute value |f|:Xβ†’[0,∞) is continuous. If f is non-negative then f is continuous if and only if {x∈X:f(x)<1} is open in X.

Suppose X is a topological vector space (TVS) over the real or complex numbers and p is a sublinear function on X. Then the following are equivalent:[7]

  1. p is continuous;
  2. p is continuous at 0;
  3. p is uniformly continuous on X;

and if p is positive then this list may be extended to include:

  1. {x∈X:p(x)<1} is open in X.

If X is a real TVS, f is a linear functional on X, and p is a continuous sublinear function on X, then f≀p on X implies that f is continuous.[7]

Relation to Minkowski functions and open convex sets

[edit | edit source]

Theorem[7]β€”If U is a convex open neighborhood of the origin in a topological vector space X then the Minkowski functional of U, pU:Xβ†’[0,∞), is a continuous non-negative sublinear function on X such that U={x∈X:pU(x)<1}; if in addition U is a balanced set then pU is a seminorm on X.

Relation to open convex sets

[edit | edit source]

Theorem[7]β€”Suppose that X is a topological vector space (not necessarily locally convex or Hausdorff) over the real or complex numbers. Then the open convex subsets of X are exactly those that are of the form z+{x∈X:p(x)<1}={x∈X:p(xβˆ’z)<1} for some z∈X and some positive continuous sublinear function p on X.

Proof

Let V be an open convex subset of X. If 0∈V then let z:=0 and otherwise let z∈V be arbitrary. Let p:Xβ†’[0,∞) be the Minkowski functional of Vβˆ’z, which is a continuous sublinear function on X since Vβˆ’z is convex, absorbing, and open (p however is not necessarily a seminorm since V was not assumed to be balanced). From X=Xβˆ’z, it follows that z+{x∈X:p(x)<1}={x∈X:p(xβˆ’z)<1}. It will be shown that V=z+{x∈X:p(x)<1}, which will complete the proof. One of the known properties of Minkowski functionals guarantees {x∈X:p(x)<1}=(0,1)(Vβˆ’z), where (0,1)(Vβˆ’z)=def{tx:0<t<1,x∈Vβˆ’z}=Vβˆ’z since Vβˆ’z is convex and contains the origin. Thus Vβˆ’z={x∈X:p(x)<1}, as desired. β—Ό

Operators

[edit | edit source]

The concept can be extended to operators that are homogeneous and subadditive. This requires only that the codomain be, say, an ordered vector space to make sense of the conditions.

Computer science definition

[edit | edit source]

In computer science, a function f:β„€+→ℝ is called sublinear if limnβ†’βˆžf(n)n=0, or f(n)∈o(n) in asymptotic notation (notice the small o). Formally, f(n)∈o(n) if and only if, for any given c>0, there exists an N such that f(n)<cn for nβ‰₯N.[8] That is, f grows slower than any linear function. The two meanings should not be confused: while a Banach functional is convex, almost the opposite is true for functions of sublinear growth: every function f(n)∈o(n) can be upper-bounded by a concave function of sublinear growth.[9]

See also

[edit | edit source]
  • Asymmetric norm β€“ Generalization of the concept of a norm
  • Lua error in Module:GetShortDescription at line 33: attempt to index field 'wikibase' (a nil value).
  • Hahn-Banach theorem β€“ Theorem on extension of bounded linear functionals
  • Linear functional β€“ Linear map from a vector space to its field of scalars
  • Minkowski functional β€“ Function made from a set
  • Norm (mathematics) β€“ Length in a vector space
  • Seminorm β€“ Mathematical function
  • Superadditivity β€“ Property of a function

Notes

[edit | edit source]

Proofs

  1. ^ Let x∈X. The triangle inequality and symmetry imply p(0)=p(x+(βˆ’x))≀p(x)+p(βˆ’x)=p(x)+p(x)=2p(x). Substituting 0 for x and then subtracting p(0) from both sides proves that 0≀p(0). Thus 0≀p(0)≀2p(x) which implies 0≀p(x). β—Ό
  2. ^ If x∈X and r:=0 then nonnegative homogeneity implies that p(0)=p(rx)=rp(x)=0p(x)=0. Consequently, 0=p(0)=p(x+(βˆ’x))≀p(x)+p(βˆ’x), which is only possible if 0≀max{p(x),p(βˆ’x)}. β—Ό
  3. ^ p(x)=p(y+(xβˆ’y))≀p(y)+p(xβˆ’y), which happens if and only if p(x)βˆ’p(y)≀p(xβˆ’y). β—Ό Substituting y:=βˆ’x and gives p(x)βˆ’p(βˆ’x)≀p(xβˆ’(βˆ’x))=p(x+x)≀p(x)+p(x), which implies βˆ’p(βˆ’x)≀p(x) (positive homogeneity is not needed; the triangle inequality suffices). β—Ό
  4. ^ Let x∈X and k∈pβˆ’1(0)∩(βˆ’pβˆ’1(0)). It remains to show that p(x+k)=p(x). The triangle inequality implies p(x+k)≀p(x)+p(k)=p(x)+0=p(x). Since p(βˆ’k)=0, p(x)=p(x)βˆ’p(βˆ’k)≀p(xβˆ’(βˆ’k))=p(x+k), as desired. β—Ό

References

[edit | edit source]
  1. ^ a b c d e f g h i Narici & Beckenstein 2011, pp. 177–220.
  2. ^ a b c Schechter 1996, pp. 313–315.
  3. ^ a b c d e Narici & Beckenstein 2011, pp. 120–121.
  4. ^ Kubrusly 2011, p. 200.
  5. ^ a b Narici & Beckenstein 2011, pp. 177–221.
  6. ^ Rudin 1991, pp. 56–62.
  7. ^ a b c d e Narici & Beckenstein 2011, pp. 192–193.
  8. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  9. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

Bibliography

[edit | edit source]
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
  • Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).