Validated sensor performance, calibration data, and the peer-reviewed science behind continuous microbial water quality monitoring.
The molecule. Tryptophan is an essential aromatic amino acid—humans and animals cannot synthesize it and must obtain it through diet. Its distinctive indole side chain makes it intrinsically fluorescent: illuminated with ultraviolet light at approximately 275 nm, tryptophan re-emits at approximately 340 nm. Tryptophan-like fluorescence (TLF) is the optical water quality parameter that captures this signature, along with the closely related fluorescence of tryptophan-containing peptides and proteins.
Direct measurement, not a proxy. At the physics level, TLF is not a proxy for anything—it is a direct optical measurement of the concentration of free tryptophan, small tryptophan-containing peptides, and exposed tryptophan residues on protein surfaces in the water column. That dissolved-protein pool is itself a near-direct measure of fresh microbial biomass: free tryptophan in surface water has a half-life of days and is dominated by recent cell lysate and active microbial secretion. TLF therefore directly measures recent proteinaceous biomass loading—a fundamental water quality parameter on its own, independent of any particular species. E. coli culture counts and TLF are best understood as co-measurements of the same underlying biological reality, not as ground truth and proxy. The Lume is calibrated against CFU/100 mL because CFU is the regulatory unit, not because CFU is the more fundamental quantity; in continuous deployments TLF actually integrates total fluorescent biomass in real time, while culture-based methods sample one indicator species at a single timepoint.
Why TLF tracks E. coli. Tryptophan in water correlates strongly with E. coli for two compounding reasons. First, both share a common origin in fecal contamination: every bacterial cell is built from protein (roughly 1% of amino acid residues are tryptophan), and feces is rich in free tryptophan, tryptophan-containing peptides, and live enteric bacteria. Wherever one is elevated, the other tends to be as well. Second, E. coli uniquely expresses the enzyme tryptophanase (encoded by the tnaA gene), which cleaves tryptophan into indole, pyruvate, and ammonia—a defining metabolic trait that has been the basis of the classical Indole Test for E. coli identification for over a century. Both tryptophan and E. coli also degrade on similar timescales in the environment, so an elevated TLF signal flags recent, biologically active contamination rather than legacy organic matter.
Why fecal sources dominate the signal. Not all bacteria or organic matter contribute equally to TLF. Enteric bacteria produce tryptophan-rich outer-membrane proteins, pili, and flagellar proteins, and lyse rapidly when they leave the gut, releasing free tryptophan and small, highly fluorescent peptides into the water. Fecal plumes deliver this material at 106–108 cells per mL—orders of magnitude above the slow turnover of native environmental bacteria, which tend to be sparse, oligotrophic, and far less protein-rich per unit volume. Meanwhile, the dominant non-fecal organics in natural waters—humic and fulvic acids from soil and plant decay—fluoresce in a different region of the excitation/emission matrix (humic-like fluorescence, ~340/440 nm) and contribute little to the tryptophan channel (~275/340 nm). The combination of high-flux, tryptophan-rich biomass from feces and a chemically distinct humic background is what lets TLF act as a relatively specific marker for recent fecal contamination rather than general organic loading.
A validated real-time proxy. A positive relationship between TLF intensity and fecal indicator bacteria (FIB), particularly E. coli, has been demonstrated across groundwater, surface water, estuarine, and urban watershed contexts on multiple continents. Because TLF is a purely optical measurement—no reagents, no incubation—it delivers contamination signals in seconds rather than the 24–48 hours required by culture-based methods. The Lume builds on this foundation by coupling TLF optics with machine learning that corrects for the environmental confounders (turbidity, temperature, dissolved organic carbon) that limit static-threshold approaches.
The Virridy Lume addresses many of the limitations identified in the TLF literature by coupling optical TLF sensing with machine learning models that account for environmental confounders. Rather than relying on static TLF thresholds, the Lume's algorithm adapts to site-specific conditions.
The Lume returns a TLF reading in 60 seconds, compared to 24–48 hours for traditional culture-based methods.
The Lume's minimum detection limit for tryptophan dissolved in deionized water.
Balanced accuracy at the ≥10 CFU/100 mL threshold against CBT field reference (sensitivity 76%, specificity 93%), approaching CBT's own inter-method reproducibility ceiling.
Site-specific calibrated categorical classifications of microbial contamination risk.
Out-of-the-box accuracy on a continuous scale across 0–1,000 CFU/100 mL.
Mean absolute percentage error in log-transformed concentration space vs. culture-based methods.
Machine learning quantification: Unlike earlier TLF sensors that report relative fluorescent units, the Lume quantifies actual E. coli concentrations using gradient-boosted decision tree models that capture nonlinear patterns in noisy sensor data. The ML model was validated across freshwater and coastal environments, including saline settings where enterococci are the regulatory indicator.
Multi-parameter correction: Built-in turbidity and temperature sensors allow the algorithm to correct for environmental interference that confounds raw TLF readings.
Drinking water performance: Validated against 216 paired Compartment Bag Test (CBT) observations in Rwanda and Kenya. Binary classification at ≥10 CFU/100 mL achieves 85% balanced accuracy (sensitivity 76%, specificity 93%), approaching CBT's own inter-method reproducibility ceiling. WHO risk category agreement within ±1 tier reaches 97%.
Ocean transferability: In initial ocean deployments, the Lume achieved over 76% categorical accuracy using just six training samples, demonstrating transferability across microbial indicators and water matrices.
Adaptive contamination detection: The Lume uses machine learning to learn normal conditions and trigger contamination alerts based on learned patterns rather than fixed thresholds. (US Patent 11,506,606 B2 — Bedell, Fankhauser, Sharpe, Wilson & Thomas)
Automated system-state classification: Time-series data from water infrastructure sensors are analyzed to classify system states and support operational decisions without manual inspection or rule-based logic. (US Patent 11,507,861 B2 — Wilson, Coyle, Thomas & Croshere)
Validated against 216 paired Compartment Bag Test (CBT) observations from drinking water programs in Rwanda (Amazi Meza school filtration, ~600,000 students) and Kenya (DRIP FUNDI borehole chlorination, ~120,000 people), May–June 2026. The Lume is statistically equivalent to CBT on average (TOST, p < 0.001) within a ±0.65 log10 equivalence margin, and agrees within combined measurement uncertainty (±0.92 log10) 88% of the time. Full validation report →
100% of chlorinated samples (n = 30, all Kenya borehole post-chlorination) had 0 CFU by CBT and were correctly classified as safe by the Lume. Free chlorine attenuates tryptophan-like fluorescence (TLF), producing a measurable reduction in sensor signal that clearly distinguishes chlorinated from unchlorinated water.
216 paired sensor–CBT observations were collected across 3 Lume 1.2 sensors, with 204 used after documented exclusions (not-in-water filter, sensor faults, baseline transitions). The CBT (Aquagenx Compartment Bag Test) is a portable, field-deployable reference method with published sensitivity of 94.9% and specificity of 96.6% versus membrane filtration (Stauber et al. 2014). The estimation model is an 8-parameter right-censored linear regression (Tobit) on log10(E. coli + 1), incorporating baseline-subtracted fluorescence, water temperature, turbidity proxy, and per-sensor calibration offsets.
The Lume provides ~288 readings per day versus a single annual CBT grab sample — over 200× the temporal density. Continuous monitoring detects transient contamination events missed by periodic sampling. Gold Standard dMRV Pilot 14 (approved September 2025) authorizes Lume estimates to substitute for laboratory grab-sample requirements under SDWS Parameter 18.
Cannot reliably distinguish 0 from 1–9 CFU: 73% balanced accuracy at the ≥1 CFU threshold. The Lume should not be used to certify zero-E. coli conformity.
Per-sensor calibration required: LOOCV agreement varies by sensor (85–98%). Each new sensor requires per-sensor calibration; quarterly CBT verification recommended.
Risk category, not precise count: R² = 0.52. Validated for categorical risk classification, not exact CFU concentrations.
The Lume algorithm has been developed and extensively validated against Colilert® E. coli in freshwater systems. The platform has also been successfully deployed in saline coastal environments, where regulatory compliance is based on enterococci. In these initial ocean deployments, Lume achieved over 76% categorical accuracy using just six training samples.
How does the Lume compare to both EPA-approved laboratory methods? We analyzed paired samples measured by Colilert (IDEXX), membrane filtration (MF), and the Lume sensor across regression, Bland-Altman agreement, and categorical classification frameworks. The fourth panel additionally refits the Lume regression directly against MF, isolating the contribution of reference-method choice from sensor performance.
The Colilert-trained Lume achieves stronger agreement with Colilert (R² = 0.881, n = 209) than the two EPA-approved methods achieve with each other (R² = 0.572, n = 153).
When refit against MF as the training target, the Lume achieves equivalent quantitative performance (R² = 0.872, n = 303) — showing the sensor is method-agnostic.
Against Colilert, the Lume achieves “almost perfect” agreement (κ = 0.88), versus only “fair” agreement between the two lab methods (κ = 0.40).
The headline result: Sensor-to-reference agreement is bounded by reference-method reproducibility, not by Lume hardware. When the Colilert-trained Lume is tested against MF (a reference it was never trained on), R² drops to 0.514 — almost exactly the disagreement between the two lab methods themselves. When the Lume is instead calibrated directly against MF, performance jumps back to R² ≈ 0.87. Whichever culture method you hand the Lume as truth, it fits that method as well as or better than the two EPA-approved methods fit each other.
Microbial water quality is most frequently assessed using E. coli as a risk indicator. However, relying exclusively on culture-based E. coli measurement is limiting: it is slow, expensive, requires trained personnel, and captures only a single point in time. Risk assessments can be significantly improved by integrating TLF as a complementary, continuously monitored parameter.
TLF sensing is purely optical. No consumables, reagents, or lab infrastructure needed for each measurement.
In Kenya groundwater studies, TLF sampling showed 14% average relative percent difference between duplicates, compared to ≥26% for culture-based methods.
Sensors can operate autonomously for months, capturing short-duration contamination events that weekly grab samples miss.
Research in Malawi showed TLF indicates broader contamination risk than microbial culturing, making it a useful high-level screening tool.
For the cost of a single lab-processed grab sample, a TLF sensor can deliver thousands of in-situ microbial estimates over weeks to months of continuous deployment.
TLF has been validated across groundwater, freshwater rivers, chlorinated piped systems, and saline coastal environments on multiple continents, with studies spanning Kenya, Malawi, the US, and France.
TLF cannot be used as a direct proxy for E. coli on an individual sample basis. The TLF signal can be influenced by dissolved organic carbon (DOC), humic-like fluorescence (HLF), turbidity, and temperature. Standardization of TLF thresholds associated with different risk levels remains an active area of research. Performance is best in groundwater, which typically has low DOC, consistent temperature, and negligible turbidity.
Sorensen et al. demonstrate in-situ tryptophan-like fluorescence as a real-time indicator of faecal contamination in drinking water supplies. (Water Research)
Sorensen et al. establish real-time detection of faecally contaminated drinking water with TLF, defining threshold values. (Sci. Total Environ.)
Nowicki et al. evaluate TLF as a measure of microbial contamination risk in Kenyan groundwater across 37 water points. (Sci. Total Environ.)
Bedell, Sharpe, Purvis, Brown & Thomas demonstrate low-cost TLF sensor concepts for fecal exposure detection. (Sustainability) • Virridy
Ward et al. conduct a nine-month monitoring program in Malawi, finding TLF is a more precautionary risk indicator. (Sci. Total Environ.)
Bedell, Harmon, Fankhauser, Shivers & Thomas field-validate a continuous in-situ fluorescence sensor coupled with ML. 83% accuracy. (Water Research) • Virridy
Multiple groups evaluate TLF for combined sewer overflow watersheds and estuarine systems. (Sci. Total Environ.; ACS ES&T Water)
Knopp, Klaus, Wilson et al. advance continuous in-situ quantification with Lume V1.2 sensor design and multi-site validation. (Water Research) • Virridy
Demaree, Fankhauser, Cole, Ross & Thomas develop sensor-informed predictive models for TOC and nutrients on the Upper Yampa River. (ES&T Water) • Virridy