Skip to content

Feature/docs#10

Open
BeardedWhale wants to merge 15 commits into
masterfrom
feature/docs
Open

Feature/docs#10
BeardedWhale wants to merge 15 commits into
masterfrom
feature/docs

Conversation

@BeardedWhale

Copy link
Copy Markdown
Contributor

Added user and developer docs

@BeardedWhale BeardedWhale requested a review from Valenzione April 29, 2020 18:48
@BeardedWhale BeardedWhale self-assigned this Apr 29, 2020

@Valenzione Valenzione left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall ok, doc coverage is great

Tow major points:

  1. Move API connected documentation to OpenAPI spec, it'll make dev-doc.md clearer and more concise.
  2. There is a lack of user perspective right now in user-doc.md. Overall we need to provide a clear message for each feature of hydro-vis, why we created it, and when\in which situation the user might want to use it. In the current state of doc, it's more about "What hydro-vis is" and not about "Why we did so"

keep up with good work!

Comment thread docs/user-documentation.md Outdated
Comment on lines +5 to +26
# Why to use and what it does

Visualization of embedding space of your model can bring you various insights about your data and model performance.

Embeddings are low-dimensional, learned continuous vector representations of discrete variables

embeddings can be used to:

- find nearest points (points that your model considered close to each other)
- detect domain drift
- detect data where model makes mistakes
- detect closes counterfactual - points that are close to each other but are classified by model as different

Lets see what information our service provides:

- Visualization of all production requests embeddings with various colorings:
- Colouring based on model prediction
- Colouring based on model confidence in predictions
- Colouring based on scores of your monitoring models
- Closest requests to specific request
- Closest counterfactuals to specific request
- All information about request

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This paragraph answers questions "What for embeddings can be used" and "what is hydro-vis" but still no direct, concrete answer to "Why hydro-vis"

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

About openApi, it is already here, in doc I specify it in very begining

Comment thread docs/user-documentation.md Outdated
embeddings can be used to:

- find nearest points (points that your model considered close to each other)
- detect domain drift

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How?

Comment thread docs/user-documentation.md Outdated

- find nearest points (points that your model considered close to each other)
- detect domain drift
- detect data where model makes mistakes

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How? (Can it really?)

Comment thread docs/user-documentation.md Outdated
- find nearest points (points that your model considered close to each other)
- detect domain drift
- detect data where model makes mistakes
- detect closes counterfactual - points that are close to each other but are classified by model as different

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Counterfactuals are calculated, not detected. Still, no clear understanding of why we might want to look at counterfactuals

Lets see what information our service provides:

- Visualization of all production requests embeddings with various colorings:
- Colouring based on model prediction

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We provide such coloring to solve which problem?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, based on returned class and confidence

Comment on lines +22 to +23
- Colouring based on model confidence in predictions
- Colouring based on scores of your monitoring models

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same goes for these two. It's "What" but not "Why"


## 1. Create Model and Application

Create your model, which will receive some inputs and return outputs which contain field `embedding`. Embedding should be a 1 D vector. Upload your model using command `hs upload`.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor comment - I'd rather use shape notation in form of tuple rather than "1 D vector".

Comment thread docs/developer-documentation.md Outdated
```


# API

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can omit this section in this doc, since it's described thoroughly in OpenAPI spec

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, it is described but OpenAPI does not have additional information, here I add some comments when to use specific requests

Comment thread docs/developer-documentation.md Outdated

visualization_metrics - metrics that are used to evaluate how good will visualization reflect your real multidimensional data in 2D/3D plot. More on visualization metrics you can find [here](#visualization-metrics)

possible visualization metrics:

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's better to put it into OpenAPI spec

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment thread docs/developer-documentation.md Outdated

Returns state of a task and result if ready

states: = ['PENDING', 'RECEIVED', 'STARTED', 'FAILURE', 'REVOKED', 'RETRY'] (Source: [Celery Docs](https://docs.celeryproject.org/en/latest/reference/celery.states.html#all-states))

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same, put it into OpenAPI spec

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants