This sounds like an entity extraction problem and fortunately twitter handles and hashtags are easily identified using a simple regular expression.
I tend to use Python code to prepare docs but this is a personal choice and Logstash or ingest pipelines are other document-enrichment tools. This question explores the same problem.
Either way, you should be OK to have a plain doc with your original text
field and use a keyword
type structured field with an array of the extracted handles or tags. If you want to remember where these handles were extracted from the text it might be an idea to use an annotated_text field instead.