Spaces:

yenniejun
/

tokenizers-languages

Runtime error

yenniejun commited on May 14

Commit

94a7df7

•

1 Parent(s): 6dcf310

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -53,8 +53,8 @@ with st.sidebar:
 	st.header('All languages are NOT created (tokenized) equal!')
 	link="This project compares the tokenization length for different languages. For some tokenizers, tokenizing a message in one language may result in 10-20x more tokens than a comparable message in another language (e.g. try English vs. Burmese). This is part of a larger project of measuring inequality in NLP. See the original article: [All languages are NOT created (tokenized) equal](https://www.artfish.ai/p/all-languages-are-not-created-tokenized) on [Art Fish Intelligence](https://www.artfish.ai/)."
 	st.markdown(link)
-	st.divider()
 	st.subheader('Tokenizer')
 	# TODO multi-select tokenizers
 	tokenizer_name = st.sidebar.selectbox('Select tokenizer', options=tokenizer_names_to_test, label_visibility='collapsed')

 	st.header('All languages are NOT created (tokenized) equal!')
 	link="This project compares the tokenization length for different languages. For some tokenizers, tokenizing a message in one language may result in 10-20x more tokens than a comparable message in another language (e.g. try English vs. Burmese). This is part of a larger project of measuring inequality in NLP. See the original article: [All languages are NOT created (tokenized) equal](https://www.artfish.ai/p/all-languages-are-not-created-tokenized) on [Art Fish Intelligence](https://www.artfish.ai/)."
 	st.markdown(link)
+    st.header('Data Visualization')
 	st.subheader('Tokenizer')
 	# TODO multi-select tokenizers
 	tokenizer_name = st.sidebar.selectbox('Select tokenizer', options=tokenizer_names_to_test, label_visibility='collapsed')