How to create tokens from the text data

Pratik_Hulji · December 23, 2019, 8:55am

Hi guys, I have converted the pdf into base64 format and now i want to convert the text data into tokens. Can you please guide me.

dadoonet · December 23, 2019, 9:12am

What is the use case?

Pratik_Hulji · December 23, 2019, 9:15am

I have to extract the data from the resume

dadoonet · December 23, 2019, 9:27am

What do you want to do with the extracted data?

Pratik_Hulji · December 23, 2019, 9:31am

This is the small task assigned to me. Further it will be used for analysis by the other team.

dadoonet · December 23, 2019, 1:46pm

What kind of analysis?

Anyway, you can use the _analyze API and use its output. With that you might be able to do the rest.

Pratik_Hulji · December 24, 2019, 5:51am

Yes, I used _analyze it takes this two parameters "tokenizer": "", "text": "".
But my approach is different.

dadoonet · December 24, 2019, 6:12am

Yes. That's what I wrote. Use the _analyze API to do step 2.

Pratik_Hulji · December 24, 2019, 6:19am

Okay Thank you

system · January 21, 2020, 6:19am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.