How To Extract PDF File Table Data Using Amazon Textract and AWS Lambda Asynchronously

How To Extract PDF File Table Data Using Amazon Textract and AWS Lambda Asynchronously

Cloud Quick Labs

2 года назад

6,655 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

@sucuklukolboregi1691
@sucuklukolboregi1691 - 15.11.2022 01:58

well i have a question, beautiful video explaining everything btw
What does this method any differ from just calling the api?

Ответить
@balajic-bs8me
@balajic-bs8me - 18.11.2022 11:40

I am watched this video. I need one clarification bro. Python trp module which part is helped in pdf file to text extract method. Please explain python trp module bro.
Thanks in advance

Ответить
@brucespencer6042
@brucespencer6042 - 30.11.2022 08:50

I appreciate the video. Quick question, I have completed the tutorial but when I run the test, it seems like my invoke lambda isn't doing anything. I believe the only thing I needed to change was the SNS topic Arn and role on lines 6 and 7 correct? I've also added the lambda layer as well and set the trigger to one of my buckets.

Appreciate your time

Ответить
@sandeepvaderarocks
@sandeepvaderarocks - 04.12.2022 22:45

What is poc as a service?

Ответить
@bhawnagupta8687
@bhawnagupta8687 - 30.12.2022 10:00

I am getting a key error in resultTextract function. Can you help me?

Ответить
@adityakommu344
@adityakommu344 - 09.02.2023 21:52

I am getting message in flight SQS, but I don't see data in s3. what might be the issue

Ответить
@inhxuanhanh3978
@inhxuanhanh3978 - 10.02.2023 21:02

Why use sqs when sns can also trigger lambda

Ответить
@AbhishekKumar-wx3dh
@AbhishekKumar-wx3dh - 18.02.2023 10:52

Sir, thanks for the video. One question, Is it good idea to directly upload the pdf in s3 ? Can we first take the pdf in our own microserver than upload it into s3.

Ответить
@Semaj1985
@Semaj1985 - 29.03.2023 17:09

Hi..Its very useful.. thanks
Can we get output in csv format?

Ответить
@prashantmittal9772
@prashantmittal9772 - 06.07.2023 17:08

@cloudquicklabs Hi can you give more information for invoking second lambda?
its same situation for me as others, it is not getting invoked or triggered by sqs, I checked SNS, it is getting response from first lambda function.

Ответить
@Jaheer-k8h
@Jaheer-k8h - 05.08.2023 15:16

Helo sir. This process working multi page pdf also

Ответить
@meetakukde2029
@meetakukde2029 - 15.01.2024 16:27

I am getting key error in result function, can you please guide me.
error message- "records"
error type- key error

Ответить
@yogeshborkhade4653
@yogeshborkhade4653 - 25.01.2024 10:31

I am facing: Exception in GetTableFromTextractResult and error is 'NoneType' object has no attribute 'key'. Any idea what would be the possible reason ?

Ответить
@QuanNguyen-z2g
@QuanNguyen-z2g - 28.03.2024 01:52

I wish to know whether this solution would be able to extract the structured table formats embedded as [IMAGES] in PDF file . How to adapt the Lambda code to meet this requirement? I look forward to your advise. Thank you

Ответить
@QuanNguyen-z2g
@QuanNguyen-z2g - 28.03.2024 01:54

I wish to know whether this solution would be able to extract the structured table formats embedded as [IMAGES] in PDF file . It seems a bit tricky to parse the images in PDF and require more overhead. How to adapt the Lambda code to meet this requirement? I look forward to your advise. Thank you

Ответить
@nazerhussain804
@nazerhussain804 - 28.03.2024 18:27

Hi sir I have a usecase that have to read multiple page invoice details and send as json format please guide me i have watched your video but im unable ro get it properly so please guide me

Ответить
@santhoshsreshta
@santhoshsreshta - 01.04.2024 22:51

can you help me parse the table data into json format, I don't see much documentation on this as it has to be fetched based on relationship id's. ideally am looking to read multipage pdf containing multiple tables and convert these tables to json (title, header, cell footer)

Ответить