How to Study Boundary Phenomena

English Reciprocals and the Limits of Categorization

Brett Reynolds

Humber Polytechnic & University of Toronto

2026-04-10

What makes a category real?

  • Traditional answer: necessary and sufficient conditions
  • Problem: linguistic categories resist definition. Pronouns share properties, but no single property is shared by all and only pronouns.
  • Alternative: a robust cluster of co-occurring properties + stabilizing mechanisms that maintain the cluster = a real category

Homeostatic = self-correcting, like body temperature: the system drifts, mechanisms push it back. A category maintained this way is projectible – you can make predictions about new instances.

No real kind without a purpose

Syntactician’s proper noun

  • Distribution: Brett left / *The Brett left
  • Agreement: 3rd person singular
  • Modification: restricted
  • Typically has proper name semantics

Semanticist’s proper name

  • Rigid designation
  • Referential opacity
  • Sense vs. reference
  • Usually instantiated by a proper noun

Brett is both. Different projections, different mechanisms, different HPCs, same extension.

Homeostasis: the virtuous circle

What holds these clusters together?

Property cluster
co-occurring properties

stabilizes →

← sustains

Mechanisms
causal processes

Five grammatical mechanisms: morphological realization rules · agreement/case systems · entrenched distributional patterns · grammaticalization pathways · community norms


Mechanisms maintain clusters. Clusters maintain mechanisms. That’s what homeostatic means. (A reciprocal relationship, as it happens.)

Stability is dynamic, not static

Grammatical categories are spinning tops, not balls in valleys.

The reciprocals puzzle

Our question is syntactic: what lexical category do each other and one another belong to? Pronoun or compound determinative?

Determinative: the word class of the, some, every — not the determiner function. Compound determinatives like somebody (some + body) and everyone (every + one) share the compound structure of each other (each + other).

Pronoun-like Determinative-like
Morphology (66) Compound; not monomorphemic; no distinct accusative, genitive, or reflexive forms
Semantics (36) Pro-form (stands in for a full NP); definite; anaphoric; requires an antecedent
Syntax (50) Not in partitives (*each other of the people); not in existentials; doesn’t take else Accepts genitive ’s; appears in object
Phonology (3) (weak signal)

155 properties. Morphology pulls one way, semantics the other, syntax is mixed. Which way do they go?

The problem with cherry-picking

Two items, 155 tests, and a strong temptation to cherry-pick.

Croft (2001) calls this methodological opportunism: consciously or not, we select tests that support our preferred analysis.

The alternative: measure the stability of diagnostic ambiguity. Vary every reasonable analytic choice and ask whether the answer changes.

The interesting question isn’t “which category?” but “how stable is the apparent boundary position under different measurement choices?”

What HPC predicts for boundary items

  • Stable position: the result doesn’t depend on how you measure
  • Cross-dimensional tension: morphology and semantics pull in different directions
  • Clean anchors: clear cases come out right, so the method is trustworthy
  • Near-parity mixture: the item sits right at the midpoint between the two categories
  • Robustness to null: scramble the data keeping its basic structure; the pattern shouldn’t appear by chance — and it doesn’t

These aren’t arbitrary desiderata. They’re consequences of the theory.

Mapping grammatical space

155 binary properties (morphology, syntax, semantics, phonology) across 138 items. This 2D projection captures ~17% of the variance; all actual measurement uses full 155-dimensional Jaccard distances.

Multiple Correspondence Analysis projection. Pronouns (blue) and determinatives (red) form regions; compound determinatives sit at the interface; reciprocals (triangles) fall in that interface zone.

Not a statistical fluke

Scramble the data 5,000 times, preserving how many properties each word has and how many words have each property. This tests whether the specific combination of features drives reciprocals’ position, not just marginal structure.

Observed pattern in only 0.6% of scrambles (p = 0.006).

Permutation null distribution. Dashed line marks the observed value.

Stable across analytic choices

Vary every reasonable analytic choice (distance metric, which properties, weighting) and show all results. Each point is one specification; Delta = mean distance to pronouns minus mean distance to determinatives.

Four metrics (Jaccard, Dice, Hamming, IDF-weighted) × two regularizations (ridge, elastic net). Positive = closer to determinatives; negative = closer to pronouns. Exception: removing morphology flips the sign.

Sign stable across most choices. Removing morphology flips it. That’s cross-dimensional tension.

Right at the midpoint

Best-fitting mixture weight: each other = 0.534, one another = 0.487

Every item sorted from determinative (0) to pronoun (1). Reciprocals sit at the midpoint.

All five expectations confirmed

Expectation Result
Stable position Result stable no matter how you measure
Cross-dimensional tension Morphology → determinative; semantics → pronoun
Clean anchors Same methods correctly identify clear cases
Near-parity mixture Best-fitting weights 0.534, 0.487 (near midpoint)
Robustness to null Pattern in only 0.6% of scrambled data


This isn’t measurement failure. It’s what a real boundary looks like.

What kind of problem is this?

Reciprocals are one or the other. But our instruments can’t resolve which.

Resolved Unresolved

Categories are internally gradient but sharply bounded. This isn’t gradience; it’s a boundary phenomenon: independent mechanisms sustaining opposed pulls.

Back to the spinning top

Property cluster
co-occurring properties

stabilizes →

← sustains

Mechanisms
causal processes


Determinative region (morphology pull)

Morphological realization rules maintain compound structure: each + other

Pronoun region (semantics pull)

Interpretive mechanisms (reference tracking, anaphoric dependencies) maintain pronoun-like behaviour


Two sets of mechanisms, two spinning tops, one boundary. Stop either spin and the tension dissolves.

How to study boundary phenomena

  1. Build comprehensive profiles (don’t cherry-pick diagnostics)
  2. Test against scrambled baselines (especially with small n)
  3. Vary specifications systematically (show all results)
  4. Calibrate against clear cases (verify known structure)
  5. Ask whether the ambiguity is stable


We asked what makes a category real. The HPC answer: maintenance. Stable ambiguity is evidence of that maintenance at work.


Paper: LingBuzz 009294 · Code: GitHub · brettreynolds.ca · brett.reynolds@humber.ca