Processing math: 100%
+ - 0:00:00
Notes for current slide
Notes for next slide


EMBRACING THE BLESSING OF DIMENSIONALITY TO DETERMINE SPECIES’ PROVENANCE





Kevin Cazelles   KCazelles

Emelia Myles-Gonzalez, Tyler Zemlak, Kevin S. McCann @McCannLab



QCBS Annual symposium, Montreal

December 18th, 2019

KevCaz/fightingNoise

1 / 64

Context


The push for provenance

2 / 64

The push for provenance


Consumers are become increasingly aware of, and interested in, the origins of their seafood, particularly as issues such as environmental sustainability, impacts on endangered species, toxin accumulations, incidents of illegal, unregulated and unreported (IUU) fishing, quality assurances and human rights abuses are better understood.

Roebuck, K. et al. (2017) Canadian's Eating in the Dark: A Report Card of International Seafood Labelling Requirements

3 / 64

Sockeye salmon (Oncorhynchus nerka)

Crawford, S.S. & Muir, A.M. (2008). Global introductions of salmon and trout in the genus Oncorhynchus: 1870–2007. Reviews in Fish Biology and Fisheries.

5 / 64

The push for provenance


1. We need regulations

e.g. Country of Origin Labelling (COOL) in USA


6 / 64

The push for provenance


1. We need regulations

e.g. Country of Origin Labelling (COOL) in USA


2. We need tools to authenticate the provenance

What this talk is about!

7 / 64

Biotracers to create spatial fingerprints


Bio + tracer

  - trace elements

  - stable isotopes

  - fatty acids

  - DNA

  - gut contents

  - etc.

8 / 64

Vertical vs horizontal strategies


The future of food authentication and food quality assurance critically depends on combining chemometrics, computational analytical methods, and bioinformatics in processing and interpreting the data obtained through analytical technique.

Danezis, G. P., et al. (2016) Food authentication: state of the art and prospects. Current Opinion in Food Science


9 / 64

Why develop horizontal strategies?


Chen, Dong, et al. (2013) Blessing of Dimensionality: High-Dimensional Feature and Its Efficient Compression for Face Verification. 2013 IEEE Conference on Computer Vision and Pattern Recognition.

10 / 64

A Bayesian framework

12 / 64

Where does this sample come from?

13 / 64

Where does this sample come from?

14 / 64

A Naive Bayes classifier

15 / 64

A Naive Bayes classifier


- A priori knowledge

- Sample (observations)


16 / 64

A Naive Bayes classifier


- A priori knowledge

- Sample (observations)


- Quantity, quality, heterogeneity


17 / 64

A Naive Bayes classifier


- A priori knowledge

- Sample (observations)


- Quantity, quality, heterogeneity


- Statistical model

18 / 64

Origins and distributions


19 / 64

Where does this sample come from?

20 / 64

Problem

21 / 64

Problem

O1 | Sample ?

22 / 64

Problem

O1 | Sample ?

Sample | O1    &    Sample | O2    known!

23 / 64

A bayesian approach



Assumptions

1. sample from one of the areas considered

2. no mixture


[O1|Sn]=[Sn|O1][O1]j[Sn|Oj][Oj]

24 / 64

Similarity & authentication


25 / 64

Similarity & authentication


26 / 64

Similarity & authentication


27 / 64

Similarity & authentication


28 / 64

Similarity & authentication


29 / 64

Similarity & authentication


30 / 64

Similarity & authentication


31 / 64

Dimensionality & authentication


Assumption: Bio-tracers ~ N(μ, Σ)

just as in a Linear Discriminant Analysis (LDA)


32 / 64

Dimensionality & authentication


Assumption: Bio-tracers ~ N(μ, Σ)

just as in a Linear Discriminant Analysis (LDA)


μ: means - fixed

Σ: covariance matrix

diagonal (variances) fixed

off-diagonal terms (symmetric) vary:

33 / 64

Dimensionality & authentication

34 / 64

Dimensionality & authentication

35 / 64

Dimensionality & authentication

36 / 64

Dimensionality & authentication

37 / 64

Dimensionality & authentication

38 / 64

Dimensionality & authentication

39 / 64

Dimensionality & authentication

40 / 64

Dimensionality & authentication

41 / 64

Dimensionality & authentication

42 / 64

Dimensionality & authentication

43 / 64

Dimensionality & authentication

44 / 64

Dimensionality & authentication

45 / 64

Dimensionality & authentication

46 / 64

Dimensionality & authentication

Blessing of dimensionality!

47 / 64

So



48 / 64

So



1. the more dissimilar the distributions the better

2. the higher the dimensionality the better


49 / 64

So



1. the more dissimilar the distributions the better

2. the higher the dimensionality the better


3. Assuming extra bio-tracers enhance 1. or 2. (or both)

50 / 64

So



1. the more dissimilar the distributions the better

2. the higher the dimensionality the better


3. Assuming extra bio-tracers enhance 1. or 2. (or both)


THE MORE BIO-TRACERS THE BETTER

51 / 64

Using the framework

52 / 64

Data


- Sockeye Salmon

- 3 regions

- 30 individuals/region

- bio-tracers:

     3 stable isotopes

     14 fatty acids

53 / 64

How?


- A priori knowledge 20 individuals | test on 10 individuals

- All combinations of bio-tracers

- Custom Bayes classifier / LDA / Deep Learning


54 / 64

How?


- A priori knowledge 20 individuals | test on 10 individuals

- All combinations of bio-tracers

- Custom Bayes classifier / LDA / Deep Learning


cor(RU, US) > cor(CA, US) = cor(CA, RU)

55 / 64

Results - increasing sample size

3 bio-tracers

56 / 64

Results - increasing dimensionality

sample size = 1

57 / 64

Perspectives

58 / 64

Perspectives


Limits

Unpack results about correlation structure

Mixture problem

Temporal variations

59 / 64

Perspectives


Future

Let's think vertical

Can we use this to create reliable maps of probabilities of origin?

60 / 64

Many thanks to



61 / 64

Many thanks to



My co-authors and the McCann Lab



62 / 64

Many thanks to



My co-authors and the McCann Lab



Food From Thought

63 / 64

THE END

64 / 64

Context


The push for provenance

2 / 64
Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow